VideoHolmes

reasoning multimodalvideo

VideoHolmes evaluates video understanding and reasoning capabilities in multimodal models.

Methodology

Imported from llm-stats public benchmark metadata. Modality: multimodal. Max score: 1. Categories: multimodal, reasoning, video. Language: en.

Leaderboard

  1. 64.0%