OpenAI MMLU

math

MMLU (Massive Multitask Language Understanding) is a comprehensive benchmark that measures a text model's multitask accuracy across 57 diverse academic and professional subjects. The test covers elementary mathematics, US history, computer science, law, morality, business ethics, clinical knowledge, and many other domains spanning STEM, humanities, social sciences, and professional fields. To attain high accuracy, models must possess extensive world knowledge and problem-solving ability.

Leaderboard

Showing 2 of 2 results

Gemma 3n E4B Instructed

35.6%

i
Gemma 3n E2B Instructed

22.3%

i