MRCR 1M (pointwise)

reasoning

MRCR 1M (pointwise) is a variant of the Multi-Round Coreference Resolution benchmark that uses pointwise evaluation for ultra-long contexts (~1M tokens). This version evaluates each response independently rather than comparatively, testing models' absolute performance on long-context reasoning tasks.

Leaderboard

Showing 1 of 1 result

Gemini 2.5 Pro

82.9%

i