Infrastructure
Latency and price differences for the same model across providers. Sample snapshot only.
Prices on this page are listed in USD per 1M tokens unless a column says otherwise.
Same-model comparison
GPT-4o mini — Same-model comparison
GPT-4o mini — Official site
Providers are sorted by a balanced score: lower latency and lower average of input/output price per 1M rank higher; official badges break ties.
- 1
Groq
Fastest- Input / 1M
- $0.15
- Output / 1M
- $0.60
- P50 latency
- 200ms
- 2
- Input / 1M
- $0.15
- Output / 1M
- $0.60
- P50 latency
- 380ms
- 3
- Input / 1M
- $0.16
- Output / 1M
- $0.62
- P50 latency
- 310ms
- 4
- Input / 1M
- $0.15
- Output / 1M
- $0.60
- P50 latency
- 420ms
- 5
- Input / 1M
- $0.15
- Output / 1M
- $0.60
- P50 latency
- 450ms
- 6
- Input / 1M
- $0.18
- Output / 1M
- $0.64
- P50 latency
- 340ms