LLM / methodology

How we verify LLM API pricing

Every price here is taken by hand from the provider's official pricing page, verified on a dated pass, and turned into a monthly figure by the same engine as the calculator — never quoted from memory or a third party. Here is exactly where each of the 28 models' numbers come from, what they mean, and what we don't claim.

Prices verified June 2026 · changes logged in the changelog

How we price LLM APIs

Four rules, the same for every model.

  1. One source of truth per provider. Every per-token rate comes from the vendor's own official pricing page (listed below) — not a blog, aggregator or screenshot.
  2. Verified by hand, then promoted. A scrape or research pass is only evidence; a number reaches our data file only after a human confirms it against the official page. Raw dumps are kept dated for audit.
  3. Computed, not quoted. The "≈ $/mo" you see is calculated by our engine from input, output and cached rates at a stated workload — so models compare on one number. We never invent a monthly price.
  4. Every change is logged. Prices are committed before the site rebuilds, so each change is dated in the public changelog with the provider it came from.

Sources — every number traces to an official page

28 models across 6 providers. Last full verification: 2026-06-20.

ProviderModelsOfficial pricing sourceLast verified
OpenAI 5 platform.openai.com/docs/pricing 2026-06-20
Anthropic 4 platform.claude.com/docs/en/about-claude/pricing 2026-06-20
Google (Gemini) 4 ai.google.dev/gemini-api/docs/pricing 2026-06-20
DeepSeek 2 api-docs.deepseek.com/quick_start/pricing 2026-06-20
xAI (Grok) 2 docs.x.ai/docs/models 2026-06-20
Mistral 11 mistral.ai/pricing 2026-06-20

Mistral's per-model lineup is taken from the official model cards on docs.mistral.ai; the public FAQ price page lists only the flagship.

Are tokens the same across models? No — and it matters

A "token" is not a fixed unit. Each model family uses its own tokenizer, so the same text becomes a different number of tokens from one provider to the next — commonly a 10–30% spread (wider for non-English text and code). Two models at the same "$ per 1M tokens" can therefore cost differently for the same prompt, because one encodes it in fewer tokens.

Tokens aren't equal in value either: a token from a frontier model carries more capability than one from a budget model, so price per token says nothing about quality.

What this means for our numbers: we compare list price per 1M tokens as the billing unit — the honest, vendor-published basis — and run every model through one identical workload (same token counts) so the ≈ $/mo column is apples-to-apples on price. Treat cross-provider gaps inside ~10–20% as a tie until you test your own prompts; the calculator lets you plug in your real token mix.

What each number means

FieldDefinition
$ input / 1MList price per 1M input (prompt) tokens — standard tier, short context.
$ output / 1MList price per 1M output (completion) tokens.
$ cached / 1MPrice per 1M cached / read input tokens. "—" = caching not offered or not yet verified (we then bill input at the full rate).
BatchMultiplier for the provider's Batch API (×0.5 = 50% off). "—" = no batch tier, or the exact discount is unverified and we hold it empty rather than guess.
ContextMaximum context window, in tokens.
TierEditorial class (frontier / mid / budget) for ranking — our judgement, not a benchmark. The calculator never compares price across tiers.

Known limits & confidence

We would rather show a gap than a guess:

What we exclude, and why

We track the 28 models we can verify against an official, first-party price. We would rather cover fewer models accurately than more on guesswork.

How we keep it current

An automated job re-reads all 6 official price pages every morning and flags any change; a second job watches context windows, caching, batch and deprecations. Those flags are evidence — a human still confirms before a number changes. Confirmed changes are dated in the changelog, and we take no commission from any model provider, so nothing here is pay-to-rank.

See it on your own numbers
The calculator uses these exact prices — plug in your volume, token mix and cache share, and it re-ranks every model live.
Open calculator
All 28 models → OpenAI pricing → Anthropic pricing → Gemini pricing → DeepSeek pricing → Grok pricing → Mistral pricing → Cheapest LLM API → Open calculator →

Frequently asked questions

Each per-token rate is taken by hand from the provider's official pricing page (listed in the sources table above) and confirmed by a person before it enters our data. The monthly figure is computed by our engine, not quoted.
No. Every model family uses its own tokenizer, so the same text becomes a different number of tokens — typically a 10–30% spread, wider for non-English text and code. We compare list price per 1M tokens as the billing unit and run one identical workload through every model, so treat small cross-provider differences as a tie until you test your own prompts.
No. The LLM section carries no affiliate links and no sponsored placement — rankings are by objective cost only. It exists to be an accurate, independent reference.
An automated job re-reads every official price page daily and flags changes; a human confirms before any number moves. The last full hand-verification was 2026-06-20, and every change is dated in the changelog.