IF

general

Instruction-Following Evaluation (IFEval) benchmark for large language models, focusing on verifiable instructions with 25 types of instructions and around 500 prompts containing one or more verifiable constraints

Leaderboard

Showing 2 of 2 results

Mistral Small 3.2 24B Instruct

84.8%

i
MiniMax M2

72.0%

i