o4-mini

o4-mini is OpenAI's latest small o-series model, optimized for fast, effective reasoning with exceptionally efficient performance in coding and visual tasks. It is faster and more affordable than o3.

Benchmark results

Benchmark Score Tags Source
Aider-Polyglot 68.9% self-reported llm-stats link →
Aider-Polyglot Edit 58.2% self-reported llm-stats link →
AIME 2024 93.4% self-reported llm-stats link →
AIME 2025 92.7% self-reported llm-stats link →
BrowseComp 51.5% self-reported llm-stats link →
CharXiv-R 72.0% self-reported llm-stats link →
GPQA 81.4% self-reported llm-stats link →
Humanity's Last Exam 14.7% self-reported llm-stats link →
MathVista 84.3% self-reported llm-stats link →
MMMU 81.6% self-reported llm-stats link →
Scale MultiChallenge 43.0% self-reported llm-stats link →
SWE-Bench Verified 68.1% self-reported llm-stats link →
TAU-bench Airline 49.2% self-reported llm-stats link →
TAU-bench Retail 71.8% self-reported llm-stats link →