MRCR

reasoning

MRCR (Multi-Round Coreference Resolution) is a synthetic long-context reasoning task where models must navigate long conversations to reproduce specific model outputs. It tests the ability to distinguish between similar requests and reason about ordering while maintaining attention across extended contexts.

Leaderboard

Showing 7 of 7 results

Gemini 2.5 Pro

93.0%

i
Gemini 1.5 Pro

82.6%

i
Gemini 1.5 Flash

71.9%

i
Gemini 2.0 Flash

69.2%

i
Gemini 1.5 Flash 8B

54.7%

i
MiMo-V2-Flash

45.7%

i
Gemini 2.5 Flash

32.0%

i