Qwen3.5-122B-A10B

Qwen3.5-122B-A10B is a multimodal Mixture-of-Experts model with 122 billion total parameters and 10 billion activated parameters. It combines strong reasoning, coding, long-context, and visual understanding performance with production-friendly efficiency and a native 262K context window.

Benchmark results

Benchmark Score Tags Source
AA-LCR 66.9% self-reported llm-stats link →
AI2D 93.3% self-reported llm-stats link →
AndroidWorld_SR 66.4% self-reported llm-stats link →
BabyVision 40.2% self-reported llm-stats link →
BFCL-V4 72.2% self-reported llm-stats link →
BrowseComp 63.8% self-reported llm-stats link →
BrowseComp-zh 69.9% self-reported llm-stats link →
C-Eval 91.9% self-reported llm-stats link →
CC-OCR 81.8% self-reported llm-stats link →
CharXiv-R 77.2% self-reported llm-stats link →
CodeForces 85.1% self-reported llm-stats link →
CountBench 97.0% self-reported llm-stats link →
DeepPlanning 24.1% self-reported llm-stats link →
DynaMath 85.9% self-reported llm-stats link →
EmbSpatialBench 83.9% self-reported llm-stats link →
ERQA 62.0% self-reported llm-stats link →
FullStackBench en 62.6% self-reported llm-stats link →
FullStackBench zh 58.7% self-reported llm-stats link →
Global PIQA 88.4% self-reported llm-stats link →
GPQA 86.6% self-reported llm-stats link →
Hallusion Bench 67.6% self-reported llm-stats link →
HMMT 2025 91.4% self-reported llm-stats link →
HMMT25 90.3% self-reported llm-stats link →
Humanity's Last Exam 47.5% self-reported llm-stats link →
Hypersim 12.7% self-reported llm-stats link →
IFBench 76.1% self-reported llm-stats link →
IFEval 93.4% self-reported llm-stats link →
Include 82.8% self-reported llm-stats link →
LingoQA 80.8% self-reported llm-stats link →
LiveCodeBench v6 78.9% self-reported llm-stats link →
LongBench v2 60.2% self-reported llm-stats link →
LVBench 74.4% self-reported llm-stats link →
MathVision 86.2% self-reported llm-stats link →
MathVista-Mini 87.4% self-reported llm-stats link →
MAXIFE 87.9% self-reported llm-stats link →
MedXpertQA 67.3% self-reported llm-stats link →
MLVU 87.3% self-reported llm-stats link →
MMBench-V1.1 92.8% self-reported llm-stats link →
MMLongBench-Doc 59.0% self-reported llm-stats link →
MMLU-Pro 86.7% self-reported llm-stats link →
MMLU-ProX 82.2% self-reported llm-stats link →
MMLU-Redux 94.0% self-reported llm-stats link →
MMMLU 86.7% self-reported llm-stats link →
MMMU 83.9% self-reported llm-stats link →
MMMU-Pro 76.9% self-reported llm-stats link →
MMStar 82.9% self-reported llm-stats link →
MMVU 74.7% self-reported llm-stats link →
Multi-Challenge 61.5% self-reported llm-stats link →
MVBench 76.6% self-reported llm-stats link →
NOVA-63 58.6% self-reported llm-stats link →
Nuscene 15.4% self-reported llm-stats link →
OCRBench 92.1% self-reported llm-stats link →
ODinW 44.5% self-reported llm-stats link →
OJBench 39.5% self-reported llm-stats link →
OmniDocBench 1.5 89.8% self-reported llm-stats link →
OSWorld-Verified 58.0% self-reported llm-stats link →
PMC-VQA 63.3% self-reported llm-stats link →
PolyMATH 68.9% self-reported llm-stats link →
RealWorldQA 85.1% self-reported llm-stats link →
RefCOCO-avg 91.3% self-reported llm-stats link →
RefSpatialBench 69.3% self-reported llm-stats link →
ScreenSpot Pro 70.4% self-reported llm-stats link →
Seal-0 44.1% self-reported llm-stats link →
SimpleVQA 61.7% self-reported llm-stats link →
SlakeVQA 81.6% self-reported llm-stats link →
SUNRGBD 36.2% self-reported llm-stats link →
SuperGPQA 67.1% self-reported llm-stats link →
SWE-Bench Verified 72.0% self-reported llm-stats link →
t2-bench 79.5% self-reported llm-stats link →
Terminal-Bench 2.0 49.4% self-reported llm-stats link →
TIR-Bench 53.2% self-reported llm-stats link →
V* 93.2% self-reported llm-stats link →
VideoMME w sub. 87.3% self-reported llm-stats link →
VideoMME w/o sub. 83.9% self-reported llm-stats link →
VideoMMMU 82.0% self-reported llm-stats link →
VITA-Bench 33.6% self-reported llm-stats link →
VLMsAreBlind 96.7% self-reported llm-stats link →
WideSearch 60.5% self-reported llm-stats link →
WMT24++ 78.3% self-reported llm-stats link →
ZEROBench 9.0% self-reported llm-stats link →
ZEROBench-Sub 36.2% self-reported llm-stats link →