MRCR v2

reasoning

MRCR v2 (Multi-Round Coreference Resolution version 2) is an enhanced version of the synthetic long-context reasoning task. It extends the original MRCR framework with improved evaluation criteria and additional complexity for testing models' ability to maintain attention and reasoning across extended contexts.

Leaderboard

Showing 7 of 7 results

Gemma 4 31B

66.4%

i
Gemma 4 26B-A4B

44.1%

i
Gemma 4 12B

43.4%

i
DiffusionGemma 26B-A4B

32.0%

i
Gemma 4 E4B

25.4%

i
Gemma 4 E2B

19.1%

i
Gemini 2.5 Flash-Lite

16.6%

i