AIME

math

American Invitational Mathematics Examination (AIME) benchmark for evaluating mathematical reasoning capabilities of large language models. Contains 30 challenging mathematical problems from AIME 2024 competition that require multi-step reasoning and advanced mathematical insight. Each problem has an integer answer between 000-999.

Leaderboard

Showing 2 of 2 results

Phi 4 Mini Reasoning

57.5%

i
MiMo-V2.5-Pro

37.3%

i