Graphwalks parents >128k
reasoning
A graph reasoning benchmark that evaluates language models' ability to find parent nodes in graphs with context length over 128k tokens, testing long-context reasoning and graph structure understanding.
Methodology
Imported from llm-stats public benchmark metadata. Modality: text. Max score: 1. Categories: long_context, reasoning, spatial_reasoning. Language: en. Verified by llm-stats: no.