DeepSeek R1 Distill Llama 70B

DeepSeek-R1 is the first-generation reasoning model built atop DeepSeek-V3 (671B total parameters, 37B activated per token). It incorporates large-scale reinforcement learning (RL) to enhance its chain-of-thought and reasoning capabilities, delivering strong performance in math, code, and multi-step reasoning tasks.

Benchmark results

Benchmark Score Tags Source
AIME 2024 86.7% self-reported llm-stats link →
GPQA 65.2% self-reported llm-stats link →
LiveCodeBench 57.5% self-reported llm-stats link →
MATH-500 94.5% self-reported llm-stats link →