CVTG-2K

vision

CVTG-2K (Chinese Visual Text Generation 2K) is a benchmark for evaluating text-to-image models on their ability to accurately render text within generated images. It measures Word Accuracy, Normalized Edit Distance (NED), and CLIPScore across 2,000 prompts.

Methodology

Imported from llm-stats public benchmark metadata. Modality: image. Max score: 1. Categories: image-generation, language, vision. Language: zh. Verified by llm-stats: no.

Leaderboard

No results yet.