o4-mini
o4-mini is OpenAI's latest small o-series model, optimized for fast, effective reasoning with exceptionally efficient performance in coding and visual tasks. It is faster and more affordable than o3.
Benchmark results
| Benchmark | Score | Tags | Source |
|---|---|---|---|
| Aider-Polyglot | 68.9% | self-reported llm-stats | link → |
| Aider-Polyglot Edit | 58.2% | self-reported llm-stats | link → |
| AIME 2024 | 93.4% | self-reported llm-stats | link → |
| AIME 2025 | 92.7% | self-reported llm-stats | link → |
| BrowseComp | 51.5% | self-reported llm-stats | link → |
| CharXiv-R | 72.0% | self-reported llm-stats | link → |
| GPQA | 81.4% | self-reported llm-stats | link → |
| Humanity's Last Exam | 14.7% | self-reported llm-stats | link → |
| MathVista | 84.3% | self-reported llm-stats | link → |
| MMMU | 81.6% | self-reported llm-stats | link → |
| Scale MultiChallenge | 43.0% | self-reported llm-stats | link → |
| SWE-Bench Verified | 68.1% | self-reported llm-stats | link → |
| TAU-bench Airline | 49.2% | self-reported llm-stats | link → |
| TAU-bench Retail | 71.8% | self-reported llm-stats | link → |