InfographicsQA

multimodal official site →

InfographicVQA dataset with 5,485 infographic images and over 30,000 questions requiring joint reasoning over document layout, textual content, graphical elements, and data visualizations with elementary reasoning and arithmetic skills

Methodology

Imported from llm-stats public benchmark metadata. Modality: multimodal. Max score: 1. Categories: multimodal, vision. Language: en. Verified by llm-stats: no.

Leaderboard

  1. Llama 3.2 90B Instruct self-reported llm-stats
    56.8%