CoVoST2 en-zh

audio official site →

CoVoST 2 English-to-Chinese subset is part of the large-scale multilingual speech translation corpus derived from Common Voice. This subset focuses specifically on English to Chinese speech translation tasks within the broader CoVoST 2 dataset.

Methodology

Imported from llm-stats public benchmark metadata. Modality: audio. Max score: 100. Categories: audio, language, speech_to_text. Language: en. Verified by llm-stats: no.

Leaderboard

  1. Qwen2.5-Omni-7B self-reported llm-stats
    41.4%