ERQA
reasoning official site →
Embodied Reasoning Question Answering benchmark consisting of 400 multiple-choice visual questions across spatial reasoning, trajectory reasoning, action reasoning, state estimation, and multi-view reasoning for evaluating AI capabilities in physical world interactions
Methodology
Imported from llm-stats public benchmark metadata. Modality: multimodal. Max score: 1. Categories: reasoning, spatial_reasoning, vision. Language: en. Verified by llm-stats: no.