PinchBench

coding

PinchBench evaluates coding agents on real-world agentic coding tasks, measuring both best-case and average performance across complex software engineering scenarios.

Leaderboard

Showing 3 of 3 results

MiMo-V2-Omni

81.2%

i
MiMo-V2-Pro

81.0%

i
GLM-5V-Turbo

80.7%

i