Qwen2.5 7B Instruct

Qwen2.5-7B-Instruct is an instruction-tuned 7B parameter language model that excels at following instructions, generating long texts (over 8K tokens), understanding structured data, and generating structured outputs like JSON. The model features enhanced capabilities in mathematics, coding, and multilingual support across 29+ languages including Chinese, English, French, Spanish, and more.

GSM8k

91.6%

i
HumanEval

84.8%

i
MBPP

79.2%

i
MATH

75.5%

i
MMLU-Redux

75.4%

i
AlignBench

73.3%

i
IFEval

71.2%

i
MultiPL-E

70.4%

i
MMLU-Pro

56.3%

i
Arena Hard

52.0%

i
GPQA

36.4%

i
LiveBench

35.9%

i
LiveCodeBench

28.7%

i
MT-Bench

87.5

i

Pricing, uptime, and speed via OpenRouter — updated Jul 17, 2026, 04:19 AM.

Provider	Status	Input	Output	Limits	Uptime	Speed	Notes
Phala	available	$0.04/Mtok	$0.10/Mtok	33K tokens context 33K tokens max output	100.0% 5m 100.0%	515 ms p50 TTFT 42 tok/s p50
Together	available	$0.30/Mtok	$0.30/Mtok	33K tokens context 2K tokens max output	100.0% 5m 100.0%	279 ms p50 TTFT 66 tok/s p50	fp8