Every Gemini model we track, at official per-token rates — verified by hand against Google's pricing page. Four models across all three tiers.
| Model | $ input /1M | $ output /1M | $ cached /1M | Batch | ≈ $/mo * |
|---|---|---|---|---|---|
| Gemini 3.1 Pro PreviewFRONTIER | $2 | $12 | $0.20 | — | $508 |
| Gemini 3.5 FlashMID | $1.50 | $9 | $0.15 | — | $381 |
| Gemini 3 Flash PreviewMID | $0.50 | $3 | $0.05 | — | $127 |
| Gemini 3.1 Flash-LiteBUDGET | $0.25 | $1.50 | $0.025 | — | $63.5 |
* Example workload — chatbot, 100k requests/mo, 2,000 input / 300 output tokens per request, 70% of input cached. Computed by the same engine as the calculator. Batch: no verified batch discount published for Gemini at our last revision — we only list discounts we've confirmed.
Cached input is billed at 10% of the input rate across all Gemini models we track — a major lever for chatbots and agents where most of the prompt repeats. The calculator models this with your cache share.
No verified batch discount published at our last revision — we only list discounts we've confirmed, so the Batch toggle in the calculator leaves Gemini prices unchanged.
We haven't verified official context-window figures for the Gemini line yet — they're listed as “—” until we do. Note the long-context surcharge: Gemini 3.1 Pro Preview bills prompts over 200k tokens at $4 in / $18 out per 1M — the calculator prices all prompts at the base rate.
Every change is verified by hand and published to the changelog — you get one email per confirmed change.
Price-change alerts only. No newsletter, unsubscribe anytime. Privacy