OpenAI pricing
GPT-5 mini Cost Calculator
GPT-5 mini balances frontier-tier capability with low cost — 25x cheaper than GPT-5.5, ideal for high-volume production.
Input
$0.20
per 1M tokens
Output
$0.80
per 1M tokens
Context window
200K
tokens
Released
2025-08
Cutoff 2025-04
✓ Exact tokenizer·$0.20 in·$0.80 out (per 1M)
Quick start with a use case
Total cost per call$0.000600
Input$0.000200
Output$0.000400
Cost comparison
Standard
$0.000600
With Caching
$0.000525
Save 12% ↓
With Batch
$0.000300
Save 50% ↓
Detailed pricing
GPT-5 mini pricing breakdown
All pricing dimensions including caching and batch discounts.
| Type | Price (per 1M tokens) |
|---|---|
| Input | $0.2000 |
| Output | $0.8000 |
| Cached input | $0.0500 |
| Batch input | $0.1000 |
| Batch output | $0.4000 |
Last verified 2026-05-01 · OpenAI official pricing
How it compares
GPT-5 mini vs alternatives
Single-call cost (1000 input + 500 output tokens) ranked from cheapest.
| Model | Per call |
|---|---|
GPT-5 mini OpenAI · this page | $0.000600 |
| DeepSeek V4 DeepSeek | $0.000900 |
| Gemini 3.0 Flash Google | $0.001250 |
| o4-mini OpenAI | $0.002700 |
| Claude Haiku 4.5 Anthropic | $0.003500 |
| Mistral Large 3 Mistral | $0.006250 |
Recommended use
When to choose GPT-5 mini
GPT-5 mini shines for general-purpose tasks, image and document understanding, high-throughput, and low-latency tasks. Token counts on this page are exact via the official tokenizer.
✓Context window of 200K tokens handles long conversations and large documents.
✓Prompt caching available — significant savings for repeated system prompts.
✓Batch API support for non-realtime workloads at ~50% discount.
✓Tool / function calling supported.
FAQ
Frequently asked questions
GPT-5 mini costs $0.20 per 1M input tokens and $0.80 per 1M output tokens. A typical chat call (1000 input + 500 output tokens) costs approximately $0.0006. Use the calculator above to estimate your specific use case.
Related
Other models you might consider
DeepSeek
DeepSeek V4
DeepSeek's open MoE flagship — top open-weight performer
$0.30 in$1.20 out
Google
Gemini 3.0 Flash
Google's high-throughput multimodal model
$0.25 in$2.00 out
OpenAI
o4-mini
OpenAI's reasoning model — cheaper than o3
$0.90 in$3.60 out
Anthropic
Claude Haiku 4.5
Anthropic's fast affordable workhorse
$1.00 in$5.00 out
Mistral
Mistral Large 3
Mistral's flagship multilingual open-weight model
$2.50 in$7.50 out