Qwen3.6-35B-A3B

Qwen3.6-35B-A3B is the first open-weight variant of the Qwen3.6 series, a multimodal Mixture-of-Experts model with 35B total parameters and 3B activated. It pairs a vision encoder with a hybrid 40-layer language model that interleaves Gated DeltaNet linear-attention blocks and Gated Attention blocks (10 × (3 × DeltaNet + 1 × Attention)) over 256 experts (8 routed + 1 shared, expert dim 512).

MMLU-Redux

93.3%

i
MMBench-V1.1

92.8%

i
AI2D

92.7%

i
AIME 2026

92.7%

i
RefCOCO-avg

92.0%

i
HMMT 2025

90.7%

i
C-Eval

90.0%

i
OmniDocBench 1.5

89.9%

i
HMMT25

89.1%

i
VideoMME w sub.

86.6%

i
MathVista-Mini

86.4%

i
MLVU

86.2%

i
GPQA

86.0%

i
RealWorldQA

85.3%

i
MMLU-Pro

85.2%

i
EmbSpatialBench

84.3%

i
VideoMMMU

83.7%

i
HMMT Feb 26

83.6%

i
VideoMME w/o sub.

82.5%

i
CC-OCR

81.9%

i
MMMU

81.7%

i
LiveCodeBench v6

80.4%

i
IMO-AnswerBench

78.9%

i
CharXiv-R

78.0%

i
MMMU-Pro

75.3%

i
MVBench

74.6%

i
SWE-Bench Verified

73.4%

i
LVBench

71.4%

i
Hallusion Bench

69.8%

i
SWE-bench Multilingual

67.2%

i
TAU3-Bench

67.2%

i
SuperGPQA

64.7%

i
RefSpatialBench

64.3%

i
MCP Atlas

62.8%

i
WideSearch

60.1%

i
SimpleVQA

58.9%

i
ZClawBench

52.6%

i
Terminal-Bench 2.0

51.5%

i
ODinW

50.8%

i
Claw-Eval

50.0%

i
SWE-Bench Pro

49.5%

i
MCP-Mark

37.0%

i
VITA-Bench

35.6%

i
ZEROBench-Sub

34.4%

i
NL2Repo

29.4%

i
SkillsBench

28.7%

i
Toolathlon

26.9%

i
DeepPlanning

25.9%

i
Humanity's Last Exam

21.4%

i

Pricing, uptime, and speed via OpenRouter — updated Jul 17, 2026, 04:19 AM.

Provider	Status	Input	Output	Limits	Uptime	Speed	Notes
AkashML	available	$0.14/Mtok	$1.00/Mtok	262K tokens context 262K tokens max output	99.9% 5m 99.9%	761 ms p50 TTFT 88 tok/s p50	fp8
Parasail	available	$0.15/Mtok cache $0.05/Mtok	$1.00/Mtok	262K tokens context 262K tokens max output	99.9% 5m 100.0%	576 ms p50 TTFT 82 tok/s p50	fp8
AtlasCloud	available	$0.19/Mtok cache $0.19/Mtok	$1.11/Mtok	262K tokens context 66K tokens max output	99.8% 5m 100.0%	1,229 ms p50 TTFT 96 tok/s p50	fp8
SiliconFlow	available	$0.20/Mtok	$1.60/Mtok	262K tokens context 262K tokens max output	99.5% 5m 100.0%	1,290 ms p50 TTFT 40 tok/s p50	fp8
WandB	available	$0.25/Mtok cache $0.25/Mtok	$1.25/Mtok	262K tokens context 262K tokens max output	99.9% 5m 100.0%	336 ms p50 TTFT 160 tok/s p50	fp8
DekaLLM	-2	$0.13/Mtok	$1.00/Mtok	262K tokens context 262K tokens max output	94% 5m 97%	704 ms p50 TTFT 59 tok/s p50