MATH (CoT)

math

MATH dataset contains 12,500 challenging competition mathematics problems from AMC 10, AMC 12, AIME, and other mathematics competitions. Each problem includes full step-by-step solutions and spans multiple difficulty levels (1-5) across seven mathematical subjects. This variant uses Chain-of-Thought prompting to encourage step-by-step reasoning.

Leaderboard

Showing 6 of 6 results

Llama 3.1 70B Instruct

68.0%

i
Ministral 3 (14B Base 2512)

67.6%

i
Mistral Large 3

67.6%

i
Ministral 3 (8B Base 2512)

62.6%

i
Ministral 3 (3B Base 2512)

60.1%

i
Llama 3.1 8B Instruct

51.9%

i