Qwen2.5-Coder 7B Instruct

Qwen2.5-Coder is a specialized coding model trained on 5.5 trillion tokens of code data, supporting 92 programming languages with a 128K context window. It excels in code generation, completion, and repair while maintaining strong performance in math and general tasks.

HumanEval

88.4%

i
GSM8k

83.9%

i
MBPP

83.5%

i
HellaSwag

76.8%

i
Winogrande

72.9%

i
MMLU-Base

68.0%

i
MMLU

67.6%

i
MMLU-Redux

66.6%

i
ARC-C

60.9%

i
CRUXEval-Input-CoT

56.5%

i
CRUXEval-Output-CoT

56.0%

i
Aider

55.6%

i
TruthfulQA

50.6%

i
MATH

46.6%

i
BigCodeBench

41.0%

i
MMLU-Pro

40.1%

i
STEM

34.0%

i
TheoremQA

34.0%

i
LiveCodeBench

18.2%

i