MATH

math

MATH dataset contains 12,500 challenging competition mathematics problems from AMC 10, AMC 12, AIME, and other mathematics competitions. Each problem includes full step-by-step solutions and spans multiple difficulty levels (1-5) across seven mathematical subjects including Prealgebra, Algebra, Number Theory, Counting and Probability, Geometry, Intermediate Algebra, and Precalculus.

Leaderboard

Showing 20 of 75 results

o3-mini

97.9%

i
o1

96.4%

i
o1

94.8%

i
Mistral Large 3

90.4%

i
MiniStral 3 (14B Instruct 2512)

90.4%

i
Gemini 2.0 Flash

89.7%

i
Kimi K2 0905

89.1%

i
Gemma 3 27B

89.0%

i
Ministral 3 (8B Instruct 2512)

87.6%

i
Gemini 2.0 Flash-Lite

86.8%

i
Gemini 1.5 Pro

86.5%

i
MiMo-V2.5-Pro

86.2%

i
o1-preview

85.5%

i
GPT-5

84.7%

i
Gemma 3 12B

83.8%

i
Qwen2.5 32B Instruct

83.1%

i
Qwen2.5 72B Instruct

83.1%

i
Ministral 3 (3B Instruct 2512)

83.0%

i
Qwen2.5 VL 32B Instruct

82.2%

i
Phi 4

80.4%

i