SimpleVQA
multimodal
SimpleVQA is a visual question answering benchmark focused on simple queries.
Methodology
Imported from llm-stats public benchmark metadata. Modality: multimodal. Max score: 100. Categories: general, image_to_text, multimodal, vision. Language: en. Verified by llm-stats: no.