orcarouter/auto is a named router we
create for every account on signup. It routes each request to the
cheapest live chat model your account has access to, chosen fresh per
request.
Usage
Default behavior
The seed configuration:- Pattern: empty — matches every chat model your account has access to. New models that come online become candidates automatically.
- Strategy:
cheapest— picks the model with the lowest per-token price among live candidates. - Default model: none. If the pattern resolves to no available models, the request fails with a clear error. You can set a default in the dashboard if you want a safety net.
openai/*), swap the
strategy, set a default_model, or delete the router entirely — same
as any named router.
Switching strategies
The dashboard exposes four strategy cards. Cheapest is the seed default above; pick another to change howorcarouter/auto resolves picks:
| Card | Backend enum | What it does |
|---|---|---|
| Cheapest | cheapest | Lowest per-token price among live candidates. The seed default. |
| Quality | quality | Highest quality score among live candidates, regardless of price. |
| Balanced | balanced | Picks a low-cost option that still meets a quality bar; falls back to the highest-quality option if nothing meets the bar. Default for new routers you create yourself. |
| Adaptive | linucb / gated_adaptive | Per-router LinUCB contextual bandit that learns from your real traffic to weigh quality, cost, latency, and reliability per request. Two sub-modes (Standard / Gated) — see Named Routers. |
When to prefer Auto Router over explicit model names
- You don’t want to pin to a specific model; you want the cheapest live chat model at each request.
- You’re prototyping and don’t want to care about which provider is up.
- You want OrcaRouter’s routing to “just work” without thinking about it.
When to prefer explicit model names
- You need deterministic output — picking different models at different times will change generation style and quality.
- You’re using features specific to one model (e.g. Claude’s
cache_control, or a model’s native image generation). - You want predictable per-request cost.
Seeing what Auto Router picked
Check theX-Orca-Resolved-Model response header. See
Response Headers.
