MRCR 128K (8-needle)

reasoning

MRCR (Multi-Round Coreference Resolution) at 128K context length with 8 needles. Models must navigate long conversations to reproduce specific model outputs, testing attention and reasoning across 128K-token contexts with 8 items to retrieve.

Leaderboard

Showing 2 of 2 results

Qwen3.7 Max

90.4%

i
MiniCPM-SALA

10.1%

i