MM-BrowserComp

multimodal

MM-BrowserComp evaluates multimodal agents on web browsing and information retrieval tasks, testing a model's ability to perceive, navigate, and extract information from real web environments.

Methodology

Imported from llm-stats public benchmark metadata. Modality: multimodal. Max score: 1. Categories: agents, multimodal, search. Language: en. Verified by llm-stats: no.

Leaderboard

  1. MiMo-V2-Omni self-reported llm-stats
    52.0%