Kernel Bench L3

coding

Kernel Bench L3 evaluates agentic GPU kernel optimization across 50 problems. Qwen reports two metrics for this benchmark: median per-problem speedup over the PyTorch eager reference and the fraction of problems faster than torch.compile.

Leaderboard

Showing 1 of 1 result

Qwen3.7 Max

96.0%

i