Every DeepSeek model we track, at official per-token rates — verified by hand against DeepSeek's pricing page. Two models across the mid and budget tiers.
| Model | $ input /1M | $ output /1M | $ cached /1M | Batch | ≈ $/mo * |
|---|---|---|---|---|---|
| DeepSeek V4 ProMID | $0.435 | $0.87 | $0.003625 | — | $52.71 |
| DeepSeek V4 FlashBUDGET | $0.14 | $0.28 | $0.0028 | — | $17.19 |
* Example workload — chatbot, 100k requests/mo, 2,000 input / 300 output tokens per request, 70% of input cached. Computed by the same engine as the calculator. Batch: no verified batch discount published for DeepSeek at our last revision — we only list discounts we've confirmed.
DeepSeek publishes the cheapest cache reads we track: DeepSeek V4 Flash at $0.0028/1M (2% of input); DeepSeek V4 Pro at $0.003625/1M (0.8% of input). The calculator models this with your cache share.
No verified batch discount published at our last revision — we only list discounts we've confirmed, so the Batch toggle in the calculator leaves DeepSeek prices unchanged.
DeepSeek V4 Pro and DeepSeek V4 Flash run a verified 1M-token context window.
Every change is verified by hand and published to the changelog — you get one email per confirmed change.
Price-change alerts only. No newsletter, unsubscribe anytime. Privacy