Supported Models

The Gloo AI platform provides access to a wide range of leading models from multiple providers. Visit the Model Explorer in Gloo Studio for a richer side-by-side comparison including reasoning capabilities, modalities, and speed ratings. The table below is fetched live in your browser from the public, unauthenticated GET /platform/v2/models endpoint — so it always reflects the current platform catalog. Use the Model ID column as the model parameter in your requests.

The rates shown are list prices (each model’s base rate). Usage is billed at the list rate plus a flat platform rate. Track real spend in the Gloo Studio billing dashboard.

Model routing. Gloo AI routes requests to the right provider automatically, with built-in resilience so traffic keeps flowing even if a provider has an issue. Routing is invisible to you and does not change a model’s price — the platform rate stays consistent regardless of how a request is served.

These model IDs work across both the Responses API (/ai/v1/responses — the recommended surface for new integrations) and Completions V2 (/ai/v2/chat/completions). Per-model capability flags (supports_tools, supports_streaming, supports_reasoning, supports_vision) are shown in the Model Capabilities table at the bottom and are also available programmatically on the GET /platform/v2/models response.

Prompt Caching

The Caching column above shows which models support prompt caching, and which type — Implicit (automatic, e.g. OpenAI, DeepSeek, Gemini, Qwen) or Explicit (opt-in per request, e.g. Anthropic). For full details on each provider’s caching mechanism, billing rates, and best practices, see the dedicated Prompt Caching guide.

Billing & PlansManage billing, payment methods, and plan upgrades.

⌘I

​Prompt Caching

Prompt Caching