AIME 2025

math

All 30 problems from the 2025 American Invitational Mathematics Examination (AIME I and AIME II), testing olympiad-level mathematical reasoning with integer answers from 000-999. Used as an AI benchmark to evaluate large language models' ability to solve complex mathematical problems requiring multi-step logical deductions and structured symbolic reasoning.

Leaderboard

Showing 20 of 114 results

Grok-4 Heavy

100.0%

i
Gemini 3 Pro

100.0%

i
GPT-5.2

100.0%

i
GPT-5.2 Pro

100.0%

i
Kimi K2-Thinking-0905

100.0%

i
Claude Opus 4.6

99.8%

i
Gemini 3 Flash

99.7%

i
GPT-5.1 High

99.6%

i
LongCat-Flash-Thinking-2601

99.6%

i
Nemotron 3 Nano (30B A3B)

99.2%

i
GPT OSS 20B High

98.7%

i
GPT-5.1 Medium

98.4%

i
Seed 2.0 Pro

98.3%

i
Step-3.5-Flash

97.3%

i
MAI-Thinking-1

97.0%

i
GPT-5.1 Codex High

96.7%

i
Sarvam-105B

96.7%

i
Sarvam-30B

96.7%

i
Kimi K2.5

96.1%

i
DeepSeek-V3.2-Speciale

96.0%

i