CMMLU

reasoning

CMMLU (Chinese Massive Multitask Language Understanding) is a comprehensive Chinese benchmark that evaluates the knowledge and reasoning capabilities of large language models across 67 different subject topics. The benchmark covers natural sciences, social sciences, engineering, and humanities with multiple-choice questions ranging from basic to advanced professional levels.

Leaderboard

Showing 6 of 6 results

MiMo-V2.5-Pro

90.2%

i
Qwen2 72B Instruct

90.1%

i
LongCat-Flash-Chat

84.3%

i
LongCat-Flash-Lite

82.5%

i
MiniCPM-SALA

81.5%

i
ERNIE 4.5

39.8%

i