Home/Google/Gemini 3.0 Flash
Google pricing

Gemini 3.0 Flash Cost Calculator

Gemini 3.0 Flash balances speed and capability with 1M context. Cheaper than 2.5 Flash with stronger benchmarks.

Input
$0.25
per 1M tokens
Output
$2.00
per 1M tokens
Context window
1000K
tokens
Released
2026-03
Cutoff 2026-01
≈ Estimated tokenizer·$0.25 in·$2.00 out (per 1M)
Quick start with a use case
Total cost per call$0.001250
Input$0.000250
Output$0.001000
Cost comparison
Standard
$0.001250
With Caching
$0.001156
Save 8% ↓
With Batch
$0.000625
Save 50% ↓
Detailed pricing

Gemini 3.0 Flash pricing breakdown

All pricing dimensions including caching and batch discounts.

TypePrice (per 1M tokens)
Input$0.2500
Output$2.0000
Cached input$0.0625
Batch input$0.1250
Batch output$1.0000

Last verified 2026-05-01 · Google official pricing

How it compares

Gemini 3.0 Flash vs alternatives

Single-call cost (1000 input + 500 output tokens) ranked from cheapest.

ModelPer call
Gemini 3.0 Flash
Google · this page
$0.001250
GPT-5 mini
OpenAI
$0.000600
DeepSeek V4
DeepSeek
$0.000900
o4-mini
OpenAI
$0.002700
Claude Haiku 4.5
Anthropic
$0.003500
Mistral Large 3
Mistral
$0.006250
Recommended use

When to choose Gemini 3.0 Flash

Gemini 3.0 Flash shines for general-purpose tasks, image and document understanding, high-throughput, low-latency tasks, and audio understanding and generation. Token counts are estimated within ~10-20% margin.

Context window of 1000K tokens handles entire codebases or book-length documents.
Prompt caching available — significant savings for repeated system prompts.
Batch API support for non-realtime workloads at ~50% discount.
Tool / function calling supported.
FAQ

Frequently asked questions

Gemini 3.0 Flash costs $0.25 per 1M input tokens and $2.00 per 1M output tokens. A typical chat call (1000 input + 500 output tokens) costs approximately $0.0013. Use the calculator above to estimate your specific use case.