MMLU-STEM

math

STEM-focused subset of the Massive Multitask Language Understanding benchmark, evaluating language models on science, technology, engineering, and mathematics topics including physics, chemistry, mathematics, and other technical subjects.

Leaderboard

Showing 2 of 2 results

Qwen2.5 32B Instruct

80.9%

i
Qwen2.5 14B Instruct

76.4%

i