AIME 2024

math

American Invitational Mathematics Examination 2024, consisting of 30 challenging mathematical reasoning problems from AIME I and AIME II competitions. Each problem requires an integer answer between 0-999 and tests advanced mathematical reasoning across algebra, geometry, combinatorics, and number theory. Used as a benchmark for evaluating mathematical reasoning capabilities in large language models at Olympiad-level difficulty.

Leaderboard

Showing 20 of 54 results

Grok-3 Mini

95.8%

i
o4-mini

93.4%

i
Grok-3

93.3%

i
LongCat-Flash-Thinking

93.3%

i
Gemini 2.5 Pro

92.0%

i
o3

91.6%

i
DeepSeek-R1-0528

91.4%

i
GLM-4.5

91.0%

i
Ministral 3 (14B Reasoning 2512)

89.8%

i
GLM-4.5-Air

89.4%

i
Gemini 2.5 Flash

88.0%

i
o3-mini

87.3%

i
DeepSeek R1 Distill Llama 70B

86.7%

i
DeepSeek R1 Zero

86.7%

i
o1-pro

86.0%

i
MiniMax M1 80K

86.0%

i
Ministral 3 (8B Reasoning 2512)

86.0%

i
Qwen3 235B A22B

85.7%

i
MiniCPM-SALA

83.8%

i
o1

83.3%

i