How to use
Put an ordered list of model IDs inextra_body.models and set
extra_body.route to "fallback". The primary model field still
matters — it’s the first attempt — but OrcaRouter ignores it in favor of
the chain if the chain is present.
Rules
- Maximum 5 models in the chain.
- Models must be the same kind — a chat fallback chain can’t drop into an image-generation model. An image fallback chain must be all image models. We don’t translate across endpoint types.
- If a model in the chain doesn’t resolve (wrong name, no channel), we silently skip it and try the next one. The request only fails if all chain entries fail.
- Billing happens for the model that actually served the response, at that model’s rate.
How to tell which model served the response
Check theX-Orca-Fallback-Level and X-Orca-Fallback-Model response
headers. See Response Headers.
When not to use this
If you want OrcaRouter to automatically pick the cheapest available model without writing a chain, useorcarouter/auto
instead. Fallback chains are for cases where you want explicit control
over the ordering.