Create a chat completion

Authorizations

Authorization

string

header

required

OrcaRouter API keys look like sk-orca-.... Pass them in the Authorization: Bearer sk-orca-... header.

Body

application/json

model

string

required

Model ID. Supports three forms:

Provider-prefixed (default): openai/gpt-4o-mini, anthropic/claude-sonnet-4.6, google/gemini-2.5-flash
Plain alias: gpt-4o-mini (when a bare-name alias is available)
Named router: orcarouter/{name} (resolves to a model at request time; orcarouter/auto is seeded on signup for every account and picks the cheapest live chat model)

Examples:

"gpt-4o"

"openai/gpt-4o"

"orcarouter/auto"

messages

object[]

required

Show child attributes

stream

boolean

When true, response is streamed as server-sent events.

stream_options

object

Only applies when stream: true.

Show child attributes

tools

object[]

Show child attributes

tool_choice

Available options:

auto,

none,

required

parallel_tool_calls

boolean

default:true

response_format

Text (default) · object

Text (default)
JSON mode
JSON Schema

Show child attributes

temperature

number

Required range: 0 <= x <= 2

top_p

number

Required range: 0 <= x <= 1

max_tokens

integer

Required range: x >= 1

max_completion_tokens

integer

Preferred over max_tokens for reasoning models.

integer

default:1

Required range: x >= 1

stop

seed

integer

For deterministic sampling.

logprobs

boolean

top_logprobs

integer

Required range: 0 <= x <= 20

presence_penalty

number

Required range: -2 <= x <= 2

frequency_penalty

number

Required range: -2 <= x <= 2

logit_bias

object

Show child attributes

user

string

reasoning_effort

enum<string>

For OpenAI reasoning models (o1, o3*, o4*, gpt-5*-pro, etc.). Anthropic Claude uses the thinking field instead; Gemini uses provider-specific configuration.

Available options:

low,

medium,

high

web_search_options

object

Enable web search on a Chat Completions request. The Responses API uses tools: [{"type": "web_search"}] instead. Honored by OpenAI search-preview models, OpenAI models that accept the modern web_search tool, and Anthropic models (translated to Anthropic's native web_search server-tool).

Show child attributes

web_search

any

Free-form raw payload forwarded to the upstream's web-search tool when web_search_options is not expressive enough. Most users should prefer web_search_options.

extra_body

object

OrcaRouter-specific request extensions. Place these under the extra_body top-level key of your chat completion request.

Show child attributes

Response

Successful completion. Streaming responses use SSE (text/event-stream).

string

object

enum<string>

Available options:

chat.completion

created

integer

model

string

choices

object[]

Show child attributes

usage

object

Show child attributes