Skip to content

Many Providers

Beyond language models, Rook’s generation features (image, video, 3D) run through a single typed seam that can carry any model from any vendor — with per-model pricing and per-route capability flags. A new model is a registration, not a release.

ProviderLifecycleCoverageBilling
Gemini (direct)syncimage · textper-token
Veo (direct)asyncvideoper-second
fal.ai (aggregator)sync · queueimage · video · 3Dper-MP · per-call
Replicate (aggregator)prediction · pollimage · video · 3Dper-compute-sec
Tencent Hunyuan (direct)async3D · full pipelineper-call · tiered

You don’t have to wade through hundreds of models — Rook curates a working set:

  • Image — Nano Banana (Gemini) · Flux 2 Pro · GPT-Image-2 Edit
  • Video — Veo · Seedance 2.0 · Kling v3
  • 3D — Hunyuan 3D 3.1 · full pipeline

The seam was designed around six realities of working with many vendors — sync vs. queued lifecycles, per-route capabilities, mixed pricing models, per-provider options, URL-or-inline results, and different secret schemes — so adding a model is routine.

BYOK throughout — one keyring, separate vaults. You use your own provider accounts; Rook just routes to them.