Qwen3 VL 235B A22B Thinking

Qwen3-VL-235B-A22B-Thinking is the most powerful vision-language model in the Qwen series, featuring 236B parameters with MoE architecture for reasoning-enhanced multimodal understanding. Key capabilities include: Visual Agent (operates PC/mobile GUIs, recognizes elements, invokes tools), Visual Coding (generates Draw.io/HTML/CSS/JS from images/videos), Advanced Spatial Perception (2D grounding and 3D grounding for spatial reasoning and embodied AI), Long Context & Video Understanding (native 256K context expandable to 1M, handles hours-long video with second-level indexing), Enhanced Multimodal Reasoning (excels in STEM/Math with causal analysis), Upgraded Visual Recognition (celebrities, anime, products, landmarks, flora/fauna), and Expanded OCR (32 languages, robust in low light/blur/tilt).

ZebraLogic

97.3%

i
DocVQAtest

96.5%

i
ScreenSpot

95.4%

i
CountBench

93.7%

i
MMLU-Redux

93.7%

i
Design2Code

93.4%

i
MIABench

92.7%

i
RefCOCO-avg

92.4%

i
MMBench-V1.1

90.6%

i
MMLU

90.6%

i
AIME 2025

89.7%

i
InfoVQAtest

89.5%

i
AI2D

89.2%

i
IFEval

88.2%

i
OCRBench

87.5%

i
WritingBench

86.7%

i
MathVista-Mini

85.8%

i
MathVerse-Mini

85.0%

i
EmbSpatialBench

84.3%

i
MLVU

83.8%

i
MMLU-Pro

83.8%

i
CC-OCR

81.5%

i
RealWorldQA

81.3%

i
MMLU-ProX

80.6%

i
MMMUval

80.6%

i
MuirBench

80.1%

i
Include

80.0%

i
VideoMMMU

80.0%

i
LiveBench 20241125

79.6%

i
Multi-IF

79.1%

i
VideoMME w/o sub.

79.0%

i
MMStar

78.7%

i
HMMT25

77.4%

i
SIFO

77.3%

i
MathVision

74.6%

i
RoboSpatialHome

73.9%

i
BFCL-v3

71.9%

i
Objectron

71.2%

i
SIFO-Multiturn

71.1%

i
LiveCodeBench v6

70.1%

i
RefSpatialBench

69.9%

i
MMMU-Pro

69.3%

i
OSWorld-G

68.3%

i
BLINK

67.1%

i
OCRBench-V2 (en)

66.8%

i
Hallusion Bench

66.7%

i
CharXiv-R

66.1%

i
SuperGPQA

64.3%

i
LVBench

63.6%

i
CharadesSTA

63.5%

i
OCRBench-V2 (zh)

63.5%

i
ScreenSpot Pro

61.8%

i
SimpleVQA

61.3%

i
MMLongBench-Doc

56.2%

i
ARKitScenes

53.7%

i
ERQA

52.5%

i
SimpleQA

44.4%

i
ODinW

43.2%

i
OSWorld

38.1%

i
SUNRGBD

34.9%

i
VisuLogic

34.4%

i
ZEROBench-Sub

27.7%

i
Humanity's Last Exam

13.6%

i
Hypersim

11.0%

i
ZEROBench

4.0%

i
MM-MT-Bench

8.5

i
Creative Writing v3

0.857

i

Pricing, uptime, and speed via OpenRouter — updated Jul 17, 2026, 04:19 AM.

Provider	Status	Input	Output	Limits	Uptime	Speed	Notes
Alibaba	available	$0.26/Mtok	$2.60/Mtok	131K tokens context 33K tokens max output	—	937 ms p50 TTFT 53 tok/s p50
Novita	available	$0.98/Mtok	$3.95/Mtok	131K tokens context 33K tokens max output	—	910 ms p50 TTFT 40 tok/s p50	bf16