MRCR 128K (4-needle)

reasoning

MRCR (Multi-Round Coreference Resolution) at 128K context length with 4 needles. Models must navigate long conversations to reproduce specific model outputs, testing attention and reasoning across 128K-token contexts with 4 items to retrieve.

Leaderboard

Showing 1 of 1 result

MiniCPM-SALA

19.6%

i