Qwen3-Next-80B-A3B-Thinking

Qwen3-Next-80B-A3B-Thinking is the thinking variant of the Qwen3-Next series, featuring the same groundbreaking architecture as the instruct model. Leveraging GSPO, it addresses stability and efficiency challenges of hybrid attention + high-sparsity MoE in RL training.

MMLU-Redux

92.5%

i
IFEval

88.9%

i
AIME 2025

87.8%

i
WritingBench

84.6%

i
MMLU-Pro

82.7%

i
Include

78.9%

i
MMLU-ProX

78.7%

i
Multi-IF

77.8%

i
GPQA

77.2%

i
LiveBench 20241125

76.6%

i
HMMT25

73.9%

i
BFCL-v3

72.0%

i
TAU-bench Retail

69.6%

i
LiveCodeBench v6

68.7%

i
Tau2 Retail

67.8%

i
Arena-Hard v2

62.3%

i
SuperGPQA

60.8%

i
Tau2 Airline

60.5%

i
PolyMATH

56.3%

i
TAU-bench Airline

49.0%

i
Tau2 Telecom

43.9%

i
OJBench

29.7%

i
CFEval

2,071

i

Pricing, uptime, and speed via OpenRouter — updated Jul 17, 2026, 04:19 AM.

Provider	Status	Input	Output	Limits	Uptime	Speed	Notes
Alibaba	available	$0.10/Mtok	$0.78/Mtok	131K tokens context 33K tokens max output	—	383 ms p50 TTFT 205 tok/s p50
Google	available	$0.15/Mtok	$1.20/Mtok	262K tokens context 262K tokens max output	—	371 ms p50 TTFT 40 tok/s p50