Qwen3.5-122B-A10B
Qwen3.5-122B-A10B is a multimodal Mixture-of-Experts model with 122 billion total parameters and 10 billion activated parameters. It combines strong reasoning, coding, long-context, and visual understanding performance with production-friendly efficiency and a native 262K context window.
Benchmark results
| Benchmark | Score | Tags | Source |
|---|---|---|---|
| AA-LCR | 66.9% | self-reported llm-stats | link → |
| AI2D | 93.3% | self-reported llm-stats | link → |
| AndroidWorld_SR | 66.4% | self-reported llm-stats | link → |
| BabyVision | 40.2% | self-reported llm-stats | link → |
| BFCL-V4 | 72.2% | self-reported llm-stats | link → |
| BrowseComp | 63.8% | self-reported llm-stats | link → |
| BrowseComp-zh | 69.9% | self-reported llm-stats | link → |
| C-Eval | 91.9% | self-reported llm-stats | link → |
| CC-OCR | 81.8% | self-reported llm-stats | link → |
| CharXiv-R | 77.2% | self-reported llm-stats | link → |
| CodeForces | 85.1% | self-reported llm-stats | link → |
| CountBench | 97.0% | self-reported llm-stats | link → |
| DeepPlanning | 24.1% | self-reported llm-stats | link → |
| DynaMath | 85.9% | self-reported llm-stats | link → |
| EmbSpatialBench | 83.9% | self-reported llm-stats | link → |
| ERQA | 62.0% | self-reported llm-stats | link → |
| FullStackBench en | 62.6% | self-reported llm-stats | link → |
| FullStackBench zh | 58.7% | self-reported llm-stats | link → |
| Global PIQA | 88.4% | self-reported llm-stats | link → |
| GPQA | 86.6% | self-reported llm-stats | link → |
| Hallusion Bench | 67.6% | self-reported llm-stats | link → |
| HMMT 2025 | 91.4% | self-reported llm-stats | link → |
| HMMT25 | 90.3% | self-reported llm-stats | link → |
| Humanity's Last Exam | 47.5% | self-reported llm-stats | link → |
| Hypersim | 12.7% | self-reported llm-stats | link → |
| IFBench | 76.1% | self-reported llm-stats | link → |
| IFEval | 93.4% | self-reported llm-stats | link → |
| Include | 82.8% | self-reported llm-stats | link → |
| LingoQA | 80.8% | self-reported llm-stats | link → |
| LiveCodeBench v6 | 78.9% | self-reported llm-stats | link → |
| LongBench v2 | 60.2% | self-reported llm-stats | link → |
| LVBench | 74.4% | self-reported llm-stats | link → |
| MathVision | 86.2% | self-reported llm-stats | link → |
| MathVista-Mini | 87.4% | self-reported llm-stats | link → |
| MAXIFE | 87.9% | self-reported llm-stats | link → |
| MedXpertQA | 67.3% | self-reported llm-stats | link → |
| MLVU | 87.3% | self-reported llm-stats | link → |
| MMBench-V1.1 | 92.8% | self-reported llm-stats | link → |
| MMLongBench-Doc | 59.0% | self-reported llm-stats | link → |
| MMLU-Pro | 86.7% | self-reported llm-stats | link → |
| MMLU-ProX | 82.2% | self-reported llm-stats | link → |
| MMLU-Redux | 94.0% | self-reported llm-stats | link → |
| MMMLU | 86.7% | self-reported llm-stats | link → |
| MMMU | 83.9% | self-reported llm-stats | link → |
| MMMU-Pro | 76.9% | self-reported llm-stats | link → |
| MMStar | 82.9% | self-reported llm-stats | link → |
| MMVU | 74.7% | self-reported llm-stats | link → |
| Multi-Challenge | 61.5% | self-reported llm-stats | link → |
| MVBench | 76.6% | self-reported llm-stats | link → |
| NOVA-63 | 58.6% | self-reported llm-stats | link → |
| Nuscene | 15.4% | self-reported llm-stats | link → |
| OCRBench | 92.1% | self-reported llm-stats | link → |
| ODinW | 44.5% | self-reported llm-stats | link → |
| OJBench | 39.5% | self-reported llm-stats | link → |
| OmniDocBench 1.5 | 89.8% | self-reported llm-stats | link → |
| OSWorld-Verified | 58.0% | self-reported llm-stats | link → |
| PMC-VQA | 63.3% | self-reported llm-stats | link → |
| PolyMATH | 68.9% | self-reported llm-stats | link → |
| RealWorldQA | 85.1% | self-reported llm-stats | link → |
| RefCOCO-avg | 91.3% | self-reported llm-stats | link → |
| RefSpatialBench | 69.3% | self-reported llm-stats | link → |
| ScreenSpot Pro | 70.4% | self-reported llm-stats | link → |
| Seal-0 | 44.1% | self-reported llm-stats | link → |
| SimpleVQA | 61.7% | self-reported llm-stats | link → |
| SlakeVQA | 81.6% | self-reported llm-stats | link → |
| SUNRGBD | 36.2% | self-reported llm-stats | link → |
| SuperGPQA | 67.1% | self-reported llm-stats | link → |
| SWE-Bench Verified | 72.0% | self-reported llm-stats | link → |
| t2-bench | 79.5% | self-reported llm-stats | link → |
| Terminal-Bench 2.0 | 49.4% | self-reported llm-stats | link → |
| TIR-Bench | 53.2% | self-reported llm-stats | link → |
| V* | 93.2% | self-reported llm-stats | link → |
| VideoMME w sub. | 87.3% | self-reported llm-stats | link → |
| VideoMME w/o sub. | 83.9% | self-reported llm-stats | link → |
| VideoMMMU | 82.0% | self-reported llm-stats | link → |
| VITA-Bench | 33.6% | self-reported llm-stats | link → |
| VLMsAreBlind | 96.7% | self-reported llm-stats | link → |
| WideSearch | 60.5% | self-reported llm-stats | link → |
| WMT24++ | 78.3% | self-reported llm-stats | link → |
| ZEROBench | 9.0% | self-reported llm-stats | link → |
| ZEROBench-Sub | 36.2% | self-reported llm-stats | link → |