Per-token API pricing for builders — OpenAI, Anthropic, Gemini, DeepSeek, xAI and Mistral, side by side at your real volume.
Drives our token-per-request estimates — you can override everything in step 2.
Estimated from your use case — override any number.
Grouped by tier — comparing a frontier model with a budget one on price alone is meaningless.
The cheapest model's cost vs the value of the human time it offsets — at your volume.
Want ROI for every model side by side? Open the full ROI calculator →
Costs marked ≈ use our auto-estimated tokens — override them in step 2 for exact math. Cached input is billed at each model's published cache-read rate; models without published cache pricing are computed at the full input rate. Models with a published long-context tier (e.g. Gemini above 200k input) switch to those rates automatically. Reasoning effort multiplies billed output to estimate thinking tokens.