MultiLF

general

MultiLF benchmark

Methodology

Imported from llm-stats public benchmark metadata. Modality: text. Max score: 1. Categories: general. Language: en. Verified by llm-stats: no.

Leaderboard

  1. Qwen3 32B self-reported llm-stats
    73.0%
  2. Qwen3 235B A22B self-reported llm-stats
    71.9%