AA-Index

general

No official academic documentation found for this benchmark. Extensive research through ArXiv, IEEE/ACL/NeurIPS papers, and university research sites yielded no peer-reviewed sources for an 'aa-index' benchmark. This entry requires verification from official academic sources.

Methodology

Imported from llm-stats public benchmark metadata. Modality: text. Max score: 1. Categories: general. Language: en. Verified by llm-stats: no.

Leaderboard

  1. GLM-4.5 self-reported llm-stats
    67.7%
  2. GLM-4.5-Air self-reported llm-stats
    64.8%
  3. MiniMax M2 self-reported llm-stats
    61.0%