Natural2Code

reasoning

NaturalCodeBench (NCB) is a challenging code benchmark designed to mirror the complexity and variety of real-world coding tasks. It comprises 402 high-quality problems in Python and Java, selected from natural user queries from online coding services, covering 6 different domains.

Leaderboard

Showing 8 of 8 results

Gemini 2.0 Flash

92.9%

i
Gemini 1.5 Pro

85.4%

i
Gemma 3 27B

84.5%

i
Gemma 3 12B

80.7%

i
Gemini 1.5 Flash

79.8%

i
Gemini 1.5 Flash 8B

75.5%

i
Gemma 3 4B

70.3%

i
Gemma 3 1B

56.0%

i