Every model. Real prices, zero markup.
Transparent pay-as-you-go pricing across 7 providers · 37 models, USD per 1M tokens. Live from /api/pricing.
No models match your filters Try clearing the search or picking a different category.
OpenAI
10 modelsGPT family · frontier reasoning
Model · Capabilities
Context
Input/1M
Output/1M
Cache/1M
gpt-5.4-nano
400K
$0.200
$1.25
$0.100
gpt-5.4-mini
400K
$0.750
$4.50
$0.375
gpt-5.2-codex
400K
$1.00
$4.00
$0.500
gpt-5.2
400K
$1.25
$5.00
$0.625
gpt-5.3-chat
400K
$1.50
$6.00
$0.750
gpt-5.3-codex
400K
$1.50
$6.00
$0.750
gpt-5.4
922K
$2.50
$15.00
$1.25
gpt-5.4-pro
922K
$5.00
$30.00
$2.50
gpt-5.5
922K
$5.00
$30.00
$0.500
gpt-5.5-pro
1.1M
$30.00
$180.00
—
Anthropic
8 modelsClaude · long context, reliable tools
Model · Capabilities
Context
Input/1M
Output/1M
Cache/1M
claude-haiku-4-5
200K
$1.00
$5.00
$0.100
claude-sonnet-4-5
200K
$3.00
$15.00
$0.300
claude-sonnet-4-6
1M
$3.00
$15.00
$0.300
claude-opus-4-5
200K
$5.00
$25.00
$0.500
claude-opus-4-6
1M
$5.00
$25.00
$0.500
claude-opus-4-7
1M
$5.00
$25.00
$0.500
claude-opus-4-8
1M
$5.00
$25.00
$0.500
claude-fable-5
1M
$10.00
$50.00
$1.00
Gemini · multimodal, large context
Model · Capabilities
Context
Input/1M
Output/1M
Cache/1M
gemini-2.5-flash-lite
1.0M
$0.100
$0.400
$0.010
gemini-3.1-flash-lite-preview
1.0M
$0.250
$1.50
—
gemini-2.5-flash
1.0M
$0.300
$2.50
$0.030
gemini-3-flash-preview
1.0M
$0.500
$3.00
$0.050
gemini-2.5-pro
1.0M
$1.25
$10.00
$0.125
gemini-3.5-flash
1.0M
$1.50
$9.00
$0.150
gemini-3.1-pro-preview
1.0M
$2.00
$12.00
$0.200
Alibaba
5 modelsQwen · cost-efficient, open weights
Model · Capabilities
Context
Input/1M
Output/1M
Cache/1M
qwen3-vl-flash
256K
$0.050+
$0.400
$0.022
qwen3.5-flash
1.0M
$0.100
$0.400
$0.029
qwen3-vl-plus
256K
$0.200+
$1.60
$0.143
qwen3.5-plus
1.0M
$0.400+
$2.40
$0.115
qwen3-max
256K
$1.20+
$6.00
$0.359
DeepSeek
2 modelsDeepSeek · strong reasoning, open weights
Model · Capabilities
Context
Input/1M
Output/1M
Cache/1M
deepseek-v4-flash
1M
$0.138
$0.275
$0.003
deepseek-v4-pro
1M
$0.435
$0.870
$0.004
Moonshot
1 modelKimi · long context
Model · Capabilities
Context
Input/1M
Output/1M
Cache/1M
kimi-k2.5
262K
$0.574
$3.01
$0.115
ByteDance
4 modelsDoubao · high-throughput inference
Model · Capabilities
Context
Input/1M
Output/1M
Cache/1M
Dola-Seed-2.0-mini
262K
$0.100+
$0.400
$0.020
ByteDance-Seed-1.8
262K
$0.250+
$2.00
$0.050
Dola-Seed-2.0-lite
262K
$0.250+
$2.00
$0.050
Dola-Seed-2.0-pro
262K
$0.500+
$3.00
$0.100
Pricing notes
- All prices are in USD. Billing is based on actual token usage.
- Input tokens = your prompt. Output tokens = the model’s response.
- Tiered pricing: a request’s entire token count is charged at the rate of the tier its input falls into. Larger inputs cross over to a higher tier.
- Cache hit: when the same prompt prefix is reused (context caching), input tokens are billed at the discounted Cache/1M rate shown above.
- Image and audio generation models are billed per API call.
- Volume discounts available for enterprise customers — contact us.