CVTG-2K
vision
CVTG-2K (Chinese Visual Text Generation 2K) is a benchmark for evaluating text-to-image models on their ability to accurately render text within generated images. It measures Word Accuracy, Normalized Edit Distance (NED), and CLIPScore across 2,000 prompts.
Methodology
Imported from llm-stats public benchmark metadata. Modality: image. Max score: 1. Categories: image-generation, language, vision. Language: zh. Verified by llm-stats: no.
Leaderboard
No results yet.