DeepSeek R1 Distill Llama 70B

DeepSeek-R1 is the first-generation reasoning model built atop DeepSeek-V3 (671B total parameters, 37B activated per token). It incorporates large-scale reinforcement learning (RL) to enhance its chain-of-thought and reasoning capabilities, delivering strong performance in math, code, and multi-step reasoning tasks.

MATH-500

94.5%

i
AIME 2024

86.7%

i
GPQA

65.2%

i
LiveCodeBench

57.5%

i

Pricing, uptime, and speed via OpenRouter — updated Jul 17, 2026, 04:19 AM.

Provider	Status	Input	Output	Limits	Uptime	Speed	Notes
Novita	available	$0.80/Mtok	$0.80/Mtok	8K tokens context 8K tokens max output	100.0%	993 ms p50 TTFT 18 tok/s p50	bf16