Early adopter pricing — first 50 customers lock this rate for life.

Pricing

Predictable pricing. No token markup.

You bring your own API keys. We optimize every call.

All plans support the OpenAI-compatible endpoint (/api/v1/chat/completions).

Free

Forever

Try routing with your own keys. Perfect for testing and small projects.

Starter

For early-stage AI SaaS teams validating model optimization.

$49/ month

Best for teams spending $1k–$5k/month on LLM APIs.

For production AI systems requiring reliability and visibility.

$149/ month

Best for teams spending $5k–$25k/month on LLM APIs.

For high-volume vertical AI platforms and infrastructure teams.

$399/ month

Best for teams spending $25k+/month on LLM APIs.

Starter gives you routing.
Growth gives you control.

Custom pricing

Infrastructure pricing. No surprises.

Do you charge per token?: No. You are billed directly by model providers using your own API keys.
What counts as a routed request?: Each call to the OpenAI-compatible endpoint (/api/v1/chat/completions) or the native API (/api/route or /api/route/stream) counts as one routed request. Fallback retries count as a single request.
Do I need multiple providers connected?: Full optimization works best with at least two providers connected.
Can I force a specific model?: Yes. Use force_model to override routing.
Is routing deterministic?: Yes. The same input, strategy, and constraints will produce the same model selection.
What happens when I hit my monthly request cap?: The API returns 429 with X-RateLimit-Limit, X-RateLimit-Remaining, and X-RateLimit-Reset headers. Your limit resets at the end of the current month (UTC).

Try the Optimizer free. Add your keys and see routing in action.