Qwen3.5-122B-A10B

Qwen3.5-122B-A10B is a multimodal Mixture-of-Experts model with 122 billion total parameters and 10 billion activated parameters. It combines strong reasoning, coding, long-context, and visual understanding performance with production-friendly efficiency and a native 262K context window.

CountBench

97.0%

i
VLMsAreBlind

96.7%

i
MMLU-Redux

94.0%

i
IFEval

93.4%

i
AI2D

93.3%

i
V*

93.2%

i
MMBench-V1.1

92.8%

i
OCRBench

92.1%

i
C-Eval

91.9%

i
HMMT 2025

91.4%

i
RefCOCO-avg

91.3%

i
HMMT25

90.3%

i
OmniDocBench 1.5

89.8%

i
Global PIQA

88.4%

i
MAXIFE

87.9%

i
MathVista-Mini

87.4%

i
MLVU

87.3%

i
VideoMME w sub.

87.3%

i
MMLU-Pro

86.7%

i
MMMLU

86.7%

i
GPQA

86.6%

i
MathVision

86.2%

i
DynaMath

85.9%

i
CodeForces

85.1%

i
RealWorldQA

85.1%

i
EmbSpatialBench

83.9%

i
MMMU

83.9%

i
VideoMME w/o sub.

83.9%

i
MMStar

82.9%

i
Include

82.8%

i
MMLU-ProX

82.2%

i
VideoMMMU

82.0%

i
CC-OCR

81.8%

i
SlakeVQA

81.6%

i
LingoQA

80.8%

i
t2-bench

79.5%

i
LiveCodeBench v6

78.9%

i
WMT24++

78.3%

i
CharXiv-R

77.2%

i
MMMU-Pro

76.9%

i
MVBench

76.6%

i
IFBench

76.1%

i
MMVU

74.7%

i
LVBench

74.4%

i
BFCL-V4

72.2%

i
SWE-Bench Verified

72.0%

i
ScreenSpot Pro

70.4%

i
BrowseComp-zh

69.9%

i
RefSpatialBench

69.3%

i
PolyMATH

68.9%

i
Hallusion Bench

67.6%

i
MedXpertQA

67.3%

i
SuperGPQA

67.1%

i
AA-LCR

66.9%

i
AndroidWorld_SR

66.4%

i
BrowseComp

63.8%

i
PMC-VQA

63.3%

i
FullStackBench en

62.6%

i
ERQA

62.0%

i
SimpleVQA

61.7%

i
Multi-Challenge

61.5%

i
WideSearch

60.5%

i
LongBench v2

60.2%

i
MMLongBench-Doc

59.0%

i
FullStackBench zh

58.7%

i
NOVA-63

58.6%

i
OSWorld-Verified

58.0%

i
TIR-Bench

53.2%

i
Terminal-Bench 2.0

49.4%

i
Humanity's Last Exam

47.5%

i
ODinW

44.5%

i
Seal-0

44.1%

i
BabyVision

40.2%

i
OJBench

39.5%

i
SUNRGBD

36.2%

i
ZEROBench-Sub

36.2%

i
VITA-Bench

33.6%

i
DeepPlanning

24.1%

i
Nuscene

15.4%

i
Hypersim

12.7%

i
ZEROBench

9.0%

i

Pricing, uptime, and speed via OpenRouter — updated Jul 17, 2026, 04:19 AM.

Provider	Status	Input	Output	Limits	Uptime	Speed	Notes
Alibaba	available	$0.26/Mtok	$2.08/Mtok	262K tokens context 66K tokens max output	100.0% 5m 100.0%	987 ms p50 TTFT 82 tok/s p50
SiliconFlow	available	$0.26/Mtok	$2.08/Mtok	262K tokens context 262K tokens max output	100.0%	1,348 ms p50 TTFT 3.0 tok/s p50	fp8
DeepInfra	available	$0.29/Mtok	$2.40/Mtok	262K tokens context 82K tokens max output	100.0%	708 ms p50 TTFT 53 tok/s p50	fp4
AtlasCloud	available	$0.30/Mtok cache $0.30/Mtok	$2.40/Mtok	262K tokens context 66K tokens max output	100.0% 5m 100.0%	1,590 ms p50 TTFT 83 tok/s p50	fp8
Novita	available	$0.40/Mtok	$3.20/Mtok	262K tokens context 66K tokens max output	—	994 ms p50 TTFT 78 tok/s p50	bf16