All models / GPT-4o mini

GPT-4o mini

openai · gpt-4o-mini · inferred pricing

Context 128k · Max output 16k · Released 2024-07-18 · Last verified 2026-04-28

Standard pricing (per 1M tokens)

Tier Input Output
Standard $0.15 $0.60

Prompt caching

Type Price / 1M
Cache type automatic
Cache read (hit) $0.07

Want to model your real cache savings? Try the cache calculator with this model selected.

Batch API

  • Input discount50%
  • Output discount50%
  • Stacks with cachingyes

See the batch decision tool for whether this fits your workload.

Capabilities

caching batch tool use vision extended thinking reasoning

Sources

Official pricing docs →

Found a price wrong? Report it