Where do your LLM prices come from?

Each per-token rate is taken by hand from the provider's official pricing page (listed in the sources table above) and confirmed by a person before it enters our data. The monthly figure is computed by our engine, not quoted.

Are token counts the same across providers?

No. Every model family uses its own tokenizer, so the same text becomes a different number of tokens — typically a 10–30% spread, wider for non-English text and code. We compare list price per 1M tokens as the billing unit and run one identical workload through every model, so treat small cross-provider differences as a tie until you test your own prompts.

Do the model providers pay you?

No. The LLM section carries no affiliate links and no sponsored placement — rankings are by objective cost only. It exists to be an accurate, independent reference.

How often are prices checked?

An automated job re-reads every official price page daily and flags changes; a human confirms before any number moves. The last full hand-verification was 2026-06-20, and every change is dated in the changelog .

How We Verify LLM API Pricing — Methodology & Sources

How we price LLM APIs

Four rules, the same for every model.

One source of truth per provider. Every per-token rate comes from the vendor's own official pricing page (listed below) — not a blog, aggregator or screenshot.
Verified by hand, then promoted. A scrape or research pass is only evidence; a number reaches our data file only after a human confirms it against the official page. Raw dumps are kept dated for audit.
Computed, not quoted. The "≈ $/mo" you see is calculated by our engine from input, output and cached rates at a stated workload — so models compare on one number. We never invent a monthly price.
Every change is logged. Prices are committed before the site rebuilds, so each change is dated in the public changelog with the provider it came from.

Sources — every number traces to an official page

28 models across 6 providers. Last full verification: 2026-06-20.

Provider	Models	Official pricing source	Last verified
OpenAI	5	platform.openai.com/docs/pricing	2026-06-20
Anthropic	4	platform.claude.com/docs/en/about-claude/pricing	2026-06-20
Google (Gemini)	4	ai.google.dev/gemini-api/docs/pricing	2026-06-20
DeepSeek	2	api-docs.deepseek.com/quick_start/pricing	2026-06-20
xAI (Grok)	2	docs.x.ai/docs/models	2026-06-20
Mistral	11	mistral.ai/pricing	2026-06-20

Mistral's per-model lineup is taken from the official model cards on docs.mistral.ai; the public FAQ price page lists only the flagship.

Are tokens the same across models? No — and it matters

A "token" is not a fixed unit. Each model family uses its own tokenizer, so the same text becomes a different number of tokens from one provider to the next — commonly a 10–30% spread (wider for non-English text and code). Two models at the same "$ per 1M tokens" can therefore cost differently for the same prompt, because one encodes it in fewer tokens.

Tokens aren't equal in value either: a token from a frontier model carries more capability than one from a budget model, so price per token says nothing about quality.

What this means for our numbers: we compare list price per 1M tokens as the billing unit — the honest, vendor-published basis — and run every model through one identical workload (same token counts) so the ≈ $/mo column is apples-to-apples on price. Treat cross-provider gaps inside ~10–20% as a tie until you test your own prompts; the calculator lets you plug in your real token mix.

What each number means

Field	Definition
$ input / 1M	List price per 1M input (prompt) tokens — standard tier, short context.
$ output / 1M	List price per 1M output (completion) tokens.
$ cached / 1M	Price per 1M cached / read input tokens. "—" = caching not offered or not yet verified (we then bill input at the full rate).
Batch	Multiplier for the provider's Batch API (×0.5 = 50% off). "—" = no batch tier, or the exact discount is unverified and we hold it empty rather than guess.
Context	Maximum context window, in tokens.
Tier	Editorial class (frontier / mid / budget) for ranking — our judgement, not a benchmark. The calculator never compares price across tiers.

Known limits & confidence

We would rather show a gap than a guess:

Mistral cached prices are held empty (except the flagship): Mistral documents a caching rule but doesn't publish per-model cached rates, so we bill input at the full rate rather than estimate.
Batch multipliers for xAI and DeepSeek are held empty: a batch or cache discount exists, but the exact figure isn't officially tabled, so we don't apply one.
Long-context surcharges (e.g. Gemini above 200k tokens) are noted in our data but not yet in the calculation — short-context pricing is shown.
Tier and our example workload are assumptions, not vendor facts — change them in the calculator for your case.

What we exclude, and why

Cohere — the current lineup is sold as enterprise instances with no public per-token price, so it can't be verified the same way.
Meta Llama — priced only through third-party hosts with their own rates; planned for a later pass once we can compare hosts fairly.

We track the 28 models we can verify against an official, first-party price. We would rather cover fewer models accurately than more on guesswork.

How we keep it current

An automated job re-reads all 6 official price pages every morning and flags any change; a second job watches context windows, caching, batch and deprecations. Those flags are evidence — a human still confirms before a number changes. Confirmed changes are dated in the changelog, and we take no commission from any model provider, so nothing here is pay-to-rank.

See it on your own numbers

The calculator uses these exact prices — plug in your volume, token mix and cache share, and it re-ranks every model live.

Open calculator

All 28 models → OpenAI pricing → Anthropic pricing → Gemini pricing → DeepSeek pricing → Grok pricing → Mistral pricing → Cheapest LLM API → Open calculator →

How we verify LLM API pricing