Qwen3.5-35B-A3B
Qwen3.5-35B-A3B is a multimodal Mixture-of-Experts model with 35 billion total parameters and 3 billion activated parameters. It combines strong reasoning, coding, agentic, and visual understanding performance with production-friendly efficiency and a native 262K context window.
Benchmark results
| Benchmark | Score | Tags | Source |
|---|---|---|---|
| AA-LCR | 58.5% | self-reported llm-stats | link → |
| AI2D | 92.6% | self-reported llm-stats | link → |
| AndroidWorld_SR | 71.1% | self-reported llm-stats | link → |
| BabyVision | 38.4% | self-reported llm-stats | link → |
| BFCL-V4 | 67.3% | self-reported llm-stats | link → |
| BrowseComp | 61.0% | self-reported llm-stats | link → |
| BrowseComp-zh | 69.5% | self-reported llm-stats | link → |
| C-Eval | 90.2% | self-reported llm-stats | link → |
| CC-OCR | 80.7% | self-reported llm-stats | link → |
| CharXiv-R | 77.5% | self-reported llm-stats | link → |
| CodeForces | 82.2% | self-reported llm-stats | link → |
| CountBench | 97.8% | self-reported llm-stats | link → |
| DeepPlanning | 22.8% | self-reported llm-stats | link → |
| DynaMath | 85.0% | self-reported llm-stats | link → |
| EmbSpatialBench | 83.1% | self-reported llm-stats | link → |
| ERQA | 64.8% | self-reported llm-stats | link → |
| FullStackBench en | 58.1% | self-reported llm-stats | link → |
| FullStackBench zh | 55.0% | self-reported llm-stats | link → |
| Global PIQA | 86.6% | self-reported llm-stats | link → |
| GPQA | 84.2% | self-reported llm-stats | link → |
| Hallusion Bench | 67.9% | self-reported llm-stats | link → |
| HMMT 2025 | 89.0% | self-reported llm-stats | link → |
| HMMT25 | 89.2% | self-reported llm-stats | link → |
| Humanity's Last Exam | 47.4% | self-reported llm-stats | link → |
| Hypersim | 13.1% | self-reported llm-stats | link → |
| IFBench | 70.2% | self-reported llm-stats | link → |
| IFEval | 91.9% | self-reported llm-stats | link → |
| Include | 79.7% | self-reported llm-stats | link → |
| LingoQA | 79.2% | self-reported llm-stats | link → |
| LiveCodeBench v6 | 74.6% | self-reported llm-stats | link → |
| LongBench v2 | 59.0% | self-reported llm-stats | link → |
| LVBench | 71.4% | self-reported llm-stats | link → |
| MathVision | 83.9% | self-reported llm-stats | link → |
| MathVista-Mini | 86.2% | self-reported llm-stats | link → |
| MAXIFE | 86.6% | self-reported llm-stats | link → |
| MedXpertQA | 61.4% | self-reported llm-stats | link → |
| MLVU | 85.6% | self-reported llm-stats | link → |
| MMBench-V1.1 | 91.5% | self-reported llm-stats | link → |
| MMLongBench-Doc | 59.5% | self-reported llm-stats | link → |
| MMLU-Pro | 85.3% | self-reported llm-stats | link → |
| MMLU-ProX | 81.0% | self-reported llm-stats | link → |
| MMLU-Redux | 93.3% | self-reported llm-stats | link → |
| MMMLU | 85.2% | self-reported llm-stats | link → |
| MMMU | 81.4% | self-reported llm-stats | link → |
| MMMU-Pro | 75.1% | self-reported llm-stats | link → |
| MMStar | 81.9% | self-reported llm-stats | link → |
| MMVU | 72.3% | self-reported llm-stats | link → |
| Multi-Challenge | 60.0% | self-reported llm-stats | link → |
| MVBench | 74.8% | self-reported llm-stats | link → |
| NOVA-63 | 57.1% | self-reported llm-stats | link → |
| Nuscene | 14.6% | self-reported llm-stats | link → |
| OCRBench | 91.0% | self-reported llm-stats | link → |
| ODinW | 42.6% | self-reported llm-stats | link → |
| OJBench | 36.0% | self-reported llm-stats | link → |
| OmniDocBench 1.5 | 89.3% | self-reported llm-stats | link → |
| OSWorld-Verified | 54.5% | self-reported llm-stats | link → |
| PMC-VQA | 62.0% | self-reported llm-stats | link → |
| PolyMATH | 64.4% | self-reported llm-stats | link → |
| RealWorldQA | 84.1% | self-reported llm-stats | link → |
| RefCOCO-avg | 89.2% | self-reported llm-stats | link → |
| RefSpatialBench | 63.5% | self-reported llm-stats | link → |
| ScreenSpot Pro | 68.6% | self-reported llm-stats | link → |
| Seal-0 | 41.4% | self-reported llm-stats | link → |
| SimpleVQA | 58.3% | self-reported llm-stats | link → |
| SlakeVQA | 78.7% | self-reported llm-stats | link → |
| SUNRGBD | 33.4% | self-reported llm-stats | link → |
| SuperGPQA | 63.4% | self-reported llm-stats | link → |
| SWE-Bench Verified | 69.2% | self-reported llm-stats | link → |
| t2-bench | 81.2% | self-reported llm-stats | link → |
| Terminal-Bench 2.0 | 40.5% | self-reported llm-stats | link → |
| TIR-Bench | 55.5% | self-reported llm-stats | link → |
| V* | 92.7% | self-reported llm-stats | link → |
| VideoMME w sub. | 86.6% | self-reported llm-stats | link → |
| VideoMME w/o sub. | 82.5% | self-reported llm-stats | link → |
| VideoMMMU | 80.4% | self-reported llm-stats | link → |
| VITA-Bench | 31.9% | self-reported llm-stats | link → |
| VLMsAreBlind | 97.0% | self-reported llm-stats | link → |
| WideSearch | 57.1% | self-reported llm-stats | link → |
| WMT24++ | 76.3% | self-reported llm-stats | link → |
| ZEROBench | 8.0% | self-reported llm-stats | link → |
| ZEROBench-Sub | 34.1% | self-reported llm-stats | link → |