Skip to main content

What you pay

OrcaRouter charges you the upstream provider’s published per-token price, with no per-token markup. Revenue comes from optional paid subscription plans, not from inflating your token cost.

The usage object

Every chat/responses response includes a usage field:
{
  "usage": {
    "prompt_tokens": 12,
    "completion_tokens": 34,
    "total_tokens": 46
  }
}
Reasoning models add completion_tokens_details.reasoning_tokens for the hidden reasoning pass.

Built-in tool charges

When the response uses a built-in / server-side tool, the call is counted and billed on top of token usage. OrcaRouter passes the upstream provider’s per-call rate through with no markup; refer to each provider’s pricing page for current rates. Tools that trigger per-call charges across providers:
  • OpenAIweb_search, web_search_preview, image_generation (Responses API built-in tools)
  • Anthropicweb_search (server tool)
  • Google GeminigoogleSearch grounding (per grounded prompt on Gemini 2.x family; per query on Gemini 3.x family)
  • xAI Grokweb_search, x_search, code_interpreter (Agent Tools API on /v1/responses)
Specific OpenAI rates currently passed through:
ToolPer-call rate
web_search$10 per 1,000 calls (unified across models)
web_search_preview10/1kforreasoningmodels(o3,o4,gpt5);10 / 1k for reasoning models (`o3*`, `o4*`, `gpt-5*`); 25 / 1k for non-reasoning models (gpt-4o*, gpt-4.1*, etc.)
For other providers’ server-tool rates, see each provider’s official pricing page.

OrcaRouter response headers

HeaderWhen set
X-Orca-Fallback-LevelIndex of the fallback model that served the response
X-Orca-Fallback-ModelName of the fallback model
X-Orca-RouterName of the orcarouter/{name} router used
X-Orca-Resolved-ModelConcrete model a named router resolved to
See Routing / Response Headers for use cases.

Looking up your usage

The Dashboard at orcarouter.ai/console shows daily spend, total spend over a chosen window, and a per-model breakdown. The Dashboard does not currently break spend down per API key — costs are aggregated at the workspace level. Two OpenAI-shape billing endpoints are also exposed for programmatic access:
# Total usage so far (returns an OpenAIUsageResponse with TotalUsage)
curl https://api.orcarouter.ai/v1/dashboard/billing/usage \
  -H "Authorization: Bearer sk-orca-..."

# Remaining quota and expiry (returns OpenAISubscriptionResponse)
curl https://api.orcarouter.ai/v1/dashboard/billing/subscription \
  -H "Authorization: Bearer sk-orca-..."
These are summary endpoints — they don’t include OpenAI’s full historical day-by-day breakdown.