MM-ClawBench

coding

MM-ClawBench evaluates models on MiniMax's Claw-style agent benchmark, measuring practical agentic task completion quality in real-world OpenClaw usage scenarios.

Methodology

Imported from llm-stats public benchmark metadata. Modality: text. Max score: 1. Categories: agents, coding. Language: en. Verified by llm-stats: no.

Leaderboard

  1. MiniMax M2.7 self-reported llm-stats
    62.7%