Pricing · updated 2026-05-17
Pay-as-you-go.
No markup, no subscription.
Fund a wallet via Stripe, requests decrement at provider list price. BYOK requests bypass the wallet entirely (you pay the provider direct). No monthly minimum, no per-seat fee, no tier upgrades.
Platform key
Provider list price
- We hold keys to Anthropic / OpenAI / Google / others
- You fund a Stripe wallet, requests decrement at list
- No markup on any model
- Cache reads at provider's cache price (~10% of input)
BYOK
Provider list + 0% surcharge
- Add your own provider keys to the workspace vault
- Provider bills you directly
- Surcharge currently waived (0%)
- Per-model whitelist, fallback toggle, audit log
$50 launch top-up
$50 → 10% off 30 days
- Fund $50, get extra 10% off platform-key requests
- 30-day discount window, balance never expires
- Stacks with normal usage
- Refundable within 7 days
Example prices
Per million tokens, USD, list price as of 2026-05-17. We pass these through 1:1. Full catalogue lives at /v1/models and in the console.
| Provider | Model | Input ($/Mtok) | Output ($/Mtok) |
|---|---|---|---|
| Anthropic | claude-sonnet-4-6 | $3.00 | $15.00 |
| OpenAI | gpt-5 | $2.50 | $10.00 |
gemini-2.5-pro | $1.25 | $5.00 | |
| DeepSeek | deepseek-v3 | $0.27 | $1.10 |
Prompt caching: cache reads ~10% of input price, cache writes ~125% of input price (Anthropic schedule). Cross-provider translation details in our prompt caching post.
FAQ
Is there a subscription or monthly minimum?
No. Pay-as-you-go. Fund the wallet once, requests decrement from it as you use them. Unused balance carries over indefinitely.
Is there markup on provider list prices?
No markup on platform-key requests — you pay exactly what we pay the provider. BYOK surcharge is currently 0% (gateway-internal routing fee only, currently waived).
How does prompt caching pricing work?
Cache reads cost ~10% of input price (Anthropic schedule). Cache writes (first call within TTL) cost ~125% of input. We pass these through directly. Cross-provider caching translation is documented in Prompt Caching Across LLM Providers.
What about refunds?
Unused balance refundable within 7 days, no questions asked. Stripe processes the refund directly. The crash-safe billing post covers how we make sure your balance stays correct even if requests die mid-flight.
Is the gateway open source?
Planned but not shipping yet. SDK clients and cookbook examples are open source; the gateway core is currently closed.
Start with whatever fits
Free signup, fund $1 if you want to try, top-up promo when you're ready.