MME-RealWorld

multimodal

A comprehensive evaluation benchmark for Multimodal Large Language Models featuring over 13,366 high-resolution images and 29,429 question-answer pairs across 43 subtasks and 5 real-world scenarios. The largest manually annotated multimodal benchmark to date, designed to test MLLMs on challenging high-resolution real-world scenarios.

Leaderboard

Showing 1 of 1 result

Qwen2.5-Omni-7B

61.6%

i