SciCode

coding

SciCode is a research coding benchmark curated by scientists that challenges language models to code solutions for scientific problems. It contains 338 subproblems decomposed from 80 challenging main problems across 16 natural science sub-fields including mathematics, physics, chemistry, biology, and materials science. Problems require knowledge recall, reasoning, and code synthesis skills.

Leaderboard

Showing 12 of 12 results

Gemini 3.1 Pro

59.0%

i
Qwen3.7 Max

53.5%

i
Kimi K2.6

52.2%

i
Kimi K2.5

48.7%

i
Kimi K2-Thinking-0905

44.8%

i
Nemotron 3 Super (120B A12B)

42.0%

i
GLM-4.5

41.7%

i
MiniMax M2.1

39.0%

i
Mercury 2

38.0%

i
GLM-4.5-Air

37.3%

i
MiniMax M2

36.0%

i
Nemotron 3 Nano (30B A3B)

33.3%

i