NMOS

general

NMOS evaluation benchmark for assessing model performance on specialized tasks

Methodology

Imported from llm-stats public benchmark metadata. Modality: text. Max score: 100. Categories: general. Language: en. Verified by llm-stats: no.

Leaderboard

  1. Qwen2.5-Omni-7B self-reported llm-stats
    4.5%