OCRBench-V2 (en)
vision official site →
OCRBench v2 English subset: Enhanced benchmark for evaluating Large Multimodal Models on visual text localization and reasoning with English text content
Methodology
Imported from llm-stats public benchmark metadata. Modality: multimodal. Max score: 1. Categories: image_to_text, vision. Language: en. Verified by llm-stats: no.