SciCode
coding official site →
SciCode is a research coding benchmark curated by scientists that challenges language models to code solutions for scientific problems. It contains 338 subproblems decomposed from 80 challenging main problems across 16 natural science sub-fields including mathematics, physics, chemistry, biology, and materials science. Problems require knowledge recall, reasoning, and code synthesis skills.
Methodology
Imported from llm-stats public benchmark metadata. Modality: text. Max score: 1. Categories: biology, chemistry, code, math, physics, reasoning. Language: en. Verified by llm-stats: no.