OmniMath

math

A Universal Olympiad Level Mathematic Benchmark for Large Language Models containing 4,428 competition-level problems with rigorous human annotation, categorized into over 33 sub-domains and spanning more than 10 distinct difficulty levels

Leaderboard

Showing 2 of 2 results

Phi 4 Reasoning Plus

81.9%

i
Phi 4 Reasoning

76.6%

i