VideoHolmes
reasoning multimodalvideo
VideoHolmes evaluates video understanding and reasoning capabilities in multimodal models.
Methodology
Imported from llm-stats public benchmark metadata. Modality: multimodal. Max score: 1. Categories: multimodal, reasoning, video. Language: en.