GPT-4o mini
openai · gpt-4o-mini · inferred pricing
Context 128k · Max output 16k · Released 2024-07-18 · Last verified 2026-04-28
Standard pricing (per 1M tokens)
| Tier | Input | Output |
|---|---|---|
| Standard | $0.15 | $0.60 |
Prompt caching
| Type | Price / 1M |
|---|---|
| Cache type | automatic |
| Cache read (hit) | $0.07 |
Want to model your real cache savings? Try the cache calculator with this model selected.
Batch API
- Input discount50%
- Output discount50%
- Stacks with cachingyes
See the batch decision tool for whether this fits your workload.
Capabilities
caching batch tool use vision extended thinking reasoning
Sources
Found a price wrong? Report it