GPT-4o

GPT-4o ('o' for 'omni') is a multimodal AI model that accepts text, audio, image, and video inputs, and generates text, audio, and image outputs. It matches GPT-4 Turbo performance on text and code, with improvements in non-English languages, vision, and audio understanding.

MGSM

90.5%

i
HumanEval

90.2%

i
MMLU

88.7%

i
DROP

83.4%

i
MATH

76.6%

i
MMLU-Pro

72.6%

i
MathVista

63.8%

i
GPQA

53.6%

i

Pricing, uptime, and speed via OpenRouter — updated Jul 17, 2026, 04:19 AM.

Provider	Status	Input	Output	Limits	Uptime	Speed	Notes
Azure	available	$5.00/Mtok	$15.00/Mtok	128K tokens context 4K tokens max output	—	1,321 ms p50 TTFT 38 tok/s p50
OpenAI	available	$5.00/Mtok	$15.00/Mtok	128K tokens context 4K tokens max output	100.0% 5m 100.0%	471 ms p50 TTFT 34 tok/s p50