VCR_en_easy

reasoning

Visual Commonsense Reasoning (VCR) benchmark that tests higher-order cognition and commonsense reasoning beyond simple object recognition. Models must answer challenging questions about images and provide rationales justifying their answers. The benchmark measures the ability to infer people's actions, goals, and mental states from visual context.

Leaderboard

Showing 1 of 1 result

Qwen2-VL-72B-Instruct

91.9%

i