LOCA-Bench (256k)

reasoning

LOCA-Bench is a long-context agentic benchmark. The 256k variant evaluates agents using the official ReAct mode with an environment description length of 256k tokens, measuring how well models reason and act over very long contexts.

Leaderboard

Showing 1 of 1 result

MiniMax M3

49.3%

i