Qwen3.5-35B-A3B

Qwen3.5-35B-A3B is a multimodal Mixture-of-Experts model with 35 billion total parameters and 3 billion activated parameters. It combines strong reasoning, coding, agentic, and visual understanding performance with production-friendly efficiency and a native 262K context window.

Benchmark results

Benchmark Score Tags Source
AA-LCR 58.5% self-reported llm-stats link →
AI2D 92.6% self-reported llm-stats link →
AndroidWorld_SR 71.1% self-reported llm-stats link →
BabyVision 38.4% self-reported llm-stats link →
BFCL-V4 67.3% self-reported llm-stats link →
BrowseComp 61.0% self-reported llm-stats link →
BrowseComp-zh 69.5% self-reported llm-stats link →
C-Eval 90.2% self-reported llm-stats link →
CC-OCR 80.7% self-reported llm-stats link →
CharXiv-R 77.5% self-reported llm-stats link →
CodeForces 82.2% self-reported llm-stats link →
CountBench 97.8% self-reported llm-stats link →
DeepPlanning 22.8% self-reported llm-stats link →
DynaMath 85.0% self-reported llm-stats link →
EmbSpatialBench 83.1% self-reported llm-stats link →
ERQA 64.8% self-reported llm-stats link →
FullStackBench en 58.1% self-reported llm-stats link →
FullStackBench zh 55.0% self-reported llm-stats link →
Global PIQA 86.6% self-reported llm-stats link →
GPQA 84.2% self-reported llm-stats link →
Hallusion Bench 67.9% self-reported llm-stats link →
HMMT 2025 89.0% self-reported llm-stats link →
HMMT25 89.2% self-reported llm-stats link →
Humanity's Last Exam 47.4% self-reported llm-stats link →
Hypersim 13.1% self-reported llm-stats link →
IFBench 70.2% self-reported llm-stats link →
IFEval 91.9% self-reported llm-stats link →
Include 79.7% self-reported llm-stats link →
LingoQA 79.2% self-reported llm-stats link →
LiveCodeBench v6 74.6% self-reported llm-stats link →
LongBench v2 59.0% self-reported llm-stats link →
LVBench 71.4% self-reported llm-stats link →
MathVision 83.9% self-reported llm-stats link →
MathVista-Mini 86.2% self-reported llm-stats link →
MAXIFE 86.6% self-reported llm-stats link →
MedXpertQA 61.4% self-reported llm-stats link →
MLVU 85.6% self-reported llm-stats link →
MMBench-V1.1 91.5% self-reported llm-stats link →
MMLongBench-Doc 59.5% self-reported llm-stats link →
MMLU-Pro 85.3% self-reported llm-stats link →
MMLU-ProX 81.0% self-reported llm-stats link →
MMLU-Redux 93.3% self-reported llm-stats link →
MMMLU 85.2% self-reported llm-stats link →
MMMU 81.4% self-reported llm-stats link →
MMMU-Pro 75.1% self-reported llm-stats link →
MMStar 81.9% self-reported llm-stats link →
MMVU 72.3% self-reported llm-stats link →
Multi-Challenge 60.0% self-reported llm-stats link →
MVBench 74.8% self-reported llm-stats link →
NOVA-63 57.1% self-reported llm-stats link →
Nuscene 14.6% self-reported llm-stats link →
OCRBench 91.0% self-reported llm-stats link →
ODinW 42.6% self-reported llm-stats link →
OJBench 36.0% self-reported llm-stats link →
OmniDocBench 1.5 89.3% self-reported llm-stats link →
OSWorld-Verified 54.5% self-reported llm-stats link →
PMC-VQA 62.0% self-reported llm-stats link →
PolyMATH 64.4% self-reported llm-stats link →
RealWorldQA 84.1% self-reported llm-stats link →
RefCOCO-avg 89.2% self-reported llm-stats link →
RefSpatialBench 63.5% self-reported llm-stats link →
ScreenSpot Pro 68.6% self-reported llm-stats link →
Seal-0 41.4% self-reported llm-stats link →
SimpleVQA 58.3% self-reported llm-stats link →
SlakeVQA 78.7% self-reported llm-stats link →
SUNRGBD 33.4% self-reported llm-stats link →
SuperGPQA 63.4% self-reported llm-stats link →
SWE-Bench Verified 69.2% self-reported llm-stats link →
t2-bench 81.2% self-reported llm-stats link →
Terminal-Bench 2.0 40.5% self-reported llm-stats link →
TIR-Bench 55.5% self-reported llm-stats link →
V* 92.7% self-reported llm-stats link →
VideoMME w sub. 86.6% self-reported llm-stats link →
VideoMME w/o sub. 82.5% self-reported llm-stats link →
VideoMMMU 80.4% self-reported llm-stats link →
VITA-Bench 31.9% self-reported llm-stats link →
VLMsAreBlind 97.0% self-reported llm-stats link →
WideSearch 57.1% self-reported llm-stats link →
WMT24++ 76.3% self-reported llm-stats link →
ZEROBench 8.0% self-reported llm-stats link →
ZEROBench-Sub 34.1% self-reported llm-stats link →