OlympiadBench

math official site →

A challenging benchmark for promoting AGI with Olympiad-level bilingual multimodal scientific problems. Comprises 8,476 math and physics problems from international and Chinese Olympiads and the Chinese college entrance exam, featuring expert-level annotations for step-by-step reasoning. Includes both text-only and multimodal problems in English and Chinese.

Methodology

Imported from llm-stats public benchmark metadata. Modality: multimodal. Max score: 1. Categories: math, multimodal, physics, reasoning, vision. Language: en. Verified by llm-stats: no.

Leaderboard

  1. QvQ-72B-Preview self-reported llm-stats
    20.4%