GDPval-MM

reasoning

GDPval-MM is the multimodal variant of the GDPval benchmark, evaluating AI model performance on real-world economically valuable tasks that require processing and generating multimodal content including documents, slides, diagrams, spreadsheets, images, and other professional deliverables across diverse industries.

Leaderboard

Showing 3 of 3 results

GPT-5.5

84.9%

i
GPT-5.5 Pro

82.3%

i
MiniMax M2.5

59.0%

i