OmniBench

reasoning

A novel multimodal benchmark designed to evaluate large language models' ability to recognize, interpret, and reason across visual, acoustic, and textual inputs simultaneously. Comprises 1,142 question-answer pairs covering 8 task categories from basic perception to complex inference, with a unique constraint that accurate responses require integrated understanding of all three modalities.

Leaderboard

Showing 1 of 1 result

Qwen2.5-Omni-7B

56.1%

i