LLM / providers / gemini

Google Gemini API Pricing

Every Gemini model we track, at official per-token rates — verified by hand against Google's pricing page. Four models across all three tiers.

Prices verified June 2026 · changes logged in the changelog
Model$ input /1M$ output /1M$ cached /1MBatch≈ $/mo *
Gemini 3.1 Pro PreviewFRONTIER $2$12$0.20$508
Gemini 3.5 FlashMID $1.50$9$0.15$381
Gemini 3 Flash PreviewMID $0.50$3$0.05$127
Gemini 3.1 Flash-LiteBUDGET $0.25$1.50$0.025$63.5

* Example workload — chatbot, 100k requests/mo, 2,000 input / 300 output tokens per request, 70% of input cached. Computed by the same engine as the calculator. Batch: no verified batch discount published for Gemini at our last revision — we only list discounts we've confirmed.

Prompt caching

Cached input is billed at 10% of the input rate across all Gemini models we track — a major lever for chatbots and agents where most of the prompt repeats. The calculator models this with your cache share.

Batch / async

No verified batch discount published at our last revision — we only list discounts we've confirmed, so the Batch toggle in the calculator leaves Gemini prices unchanged.

Context window

We haven't verified official context-window figures for the Gemini line yet — they're listed as “—” until we do. Note the long-context surcharge: Gemini 3.1 Pro Preview bills prompts over 200k tokens at $4 in / $18 out per 1M — the calculator prices all prompts at the base rate.

Is Gemini the right price for your workload?
The calculator puts these four models next to the other 14 we track — at your volume, token mix and cache share.
Open calculator
Price-drop alerts
Get an email when any tracked LLM provider changes pricing.

Every change is verified by hand and published to the changelog — you get one email per confirmed change.

Price-change alerts only. No newsletter, unsubscribe anytime. Privacy

All 18 models → OpenAI pricing → Anthropic pricing → DeepSeek pricing → Grok pricing → Mistral pricing → Price changelog →