GPT-4o

GPT-4o ('o' for 'omni') is a multimodal AI model that accepts text, audio, image, and video inputs, and generates text, audio, and image outputs. It matches GPT-4 Turbo performance on text and code, with improvements in non-English languages, vision, and audio understanding.

AI2D

94.2%

i
DocVQA

92.8%

i
ChartQA

85.7%

i
MMLU

85.7%

i
CharXiv-D

85.3%

i
MMMLU

81.4%

i
IFEval

81.0%

i
MMLU-Pro

74.7%

i
EgoSchema

72.2%

i
MMMU

72.2%

i
GPQA

70.1%

i
ComplexFuncBench

66.5%

i
Tau2 Retail

63.4%

i
ActivityNet

61.9%

i
MathVista

61.4%

i
VideoMMMU

61.2%

i
COLLIE

61.0%

i
Multi-IF

60.9%

i
TAU-bench Retail

60.3%

i
MMMU-Pro

59.9%

i
CharXiv-R

58.8%

i
Tau2 Airline

45.5%

i
TAU-bench Airline

42.8%

i
Graphwalks BFS <128k

41.7%

i
Multi-Challenge

40.3%

i
Scale MultiChallenge

40.3%

i
MultiChallenge (o3-mini grader)

39.9%

i
SimpleQA

38.2%

i
Graphwalks parents <128k

35.4%

i
ERQA

35.2%

i
SWE-Bench Verified

33.2%

i
SWE-Lancer

32.6%

i
OpenAI-MRCR: 2 needle 128k

31.9%

i
Aider-Polyglot

30.7%

i
Internal API instruction following (hard)

29.2%

i
Tau2 Telecom

23.5%

i
Aider-Polyglot Edit

18.2%

i
AIME 2024

13.1%

i
SWE-Lancer (IC-Diamond subset)

12.4%

i
Humanity's Last Exam

5.3%

i

Pricing, uptime, and speed via OpenRouter — updated Jul 17, 2026, 04:19 AM.

Provider	Status	Input	Output	Limits	Uptime	Speed	Notes
Azure	available	$2.50/Mtok cache $1.25/Mtok	$10.00/Mtok	128K tokens context 16K tokens max output	100.0% 5m 100.0%	863 ms p50 TTFT 70 tok/s p50
OpenAI	available	$2.50/Mtok cache $1.25/Mtok	$10.00/Mtok	128K tokens context 16K tokens max output	100.0% 5m 100.0%	933 ms p50 TTFT 45 tok/s p50