Every price here is taken by hand from the provider's official pricing page, verified on a dated pass, and turned into a monthly figure by the same engine as the calculator — never quoted from memory or a third party. Here is exactly where each of the 28 models' numbers come from, what they mean, and what we don't claim.
Four rules, the same for every model.
28 models across 6 providers. Last full verification: 2026-06-20.
| Provider | Models | Official pricing source | Last verified |
|---|---|---|---|
| OpenAI | 5 | platform.openai.com/docs/pricing | 2026-06-20 |
| Anthropic | 4 | platform.claude.com/docs/en/about-claude/pricing | 2026-06-20 |
| Google (Gemini) | 4 | ai.google.dev/gemini-api/docs/pricing | 2026-06-20 |
| DeepSeek | 2 | api-docs.deepseek.com/quick_start/pricing | 2026-06-20 |
| xAI (Grok) | 2 | docs.x.ai/docs/models | 2026-06-20 |
| Mistral | 11 | mistral.ai/pricing | 2026-06-20 |
Mistral's per-model lineup is taken from the official model cards on docs.mistral.ai; the public FAQ price page lists only the flagship.
A "token" is not a fixed unit. Each model family uses its own tokenizer, so the same text becomes a different number of tokens from one provider to the next — commonly a 10–30% spread (wider for non-English text and code). Two models at the same "$ per 1M tokens" can therefore cost differently for the same prompt, because one encodes it in fewer tokens.
Tokens aren't equal in value either: a token from a frontier model carries more capability than one from a budget model, so price per token says nothing about quality.
What this means for our numbers: we compare list price per 1M tokens as the billing unit — the honest, vendor-published basis — and run every model through one identical workload (same token counts) so the ≈ $/mo column is apples-to-apples on price. Treat cross-provider gaps inside ~10–20% as a tie until you test your own prompts; the calculator lets you plug in your real token mix.
| Field | Definition |
|---|---|
| $ input / 1M | List price per 1M input (prompt) tokens — standard tier, short context. |
| $ output / 1M | List price per 1M output (completion) tokens. |
| $ cached / 1M | Price per 1M cached / read input tokens. "—" = caching not offered or not yet verified (we then bill input at the full rate). |
| Batch | Multiplier for the provider's Batch API (×0.5 = 50% off). "—" = no batch tier, or the exact discount is unverified and we hold it empty rather than guess. |
| Context | Maximum context window, in tokens. |
| Tier | Editorial class (frontier / mid / budget) for ranking — our judgement, not a benchmark. The calculator never compares price across tiers. |
We would rather show a gap than a guess:
We track the 28 models we can verify against an official, first-party price. We would rather cover fewer models accurately than more on guesswork.
An automated job re-reads all 6 official price pages every morning and flags any change; a second job watches context windows, caching, batch and deprecations. Those flags are evidence — a human still confirms before a number changes. Confirmed changes are dated in the changelog, and we take no commission from any model provider, so nothing here is pay-to-rank.