Skip to content

Models Benchmarks Providers

Search models and benchmarks /

SimpleVQA

multimodal

Categories: general, image to text, multimodal, vision
Modality: multimodal
Language: en
Multilingual: No
Max score: 100
Scoring: %, higher is better
Verified by llm-stats: No

SimpleVQA is a visual question answering benchmark focused on simple queries.

Leaderboard

Showing 10 of 10 results

GLM-5V-Turbo

78.2%

i
Muse Spark

71.3%

i
Kimi K2.5

71.2%

i
Qwen3.6 Plus

67.3%

i
Qwen3.5-122B-A10B

61.7%

i
Qwen3 VL 235B A22B Thinking

61.3%

i
Qwen3.6-35B-A3B

58.9%

i
Qwen3.5-35B-A3B

58.3%

i
Qwen3.6-27B

56.1%

i
Qwen3.5-27B

56.0%

i

Wikibench About Theme Content licensed CC BY-SA 4.0.