GLM-5V-Turbo

GLM-5V-Turbo is Z.AI's first multimodal coding foundation model, built for vision-based coding tasks. It natively processes multimodal inputs including images, video, text, and files, while excelling at long-horizon planning, complex coding, and action execution.

Design2Code

94.8%

i
Flame-VLM-Code

93.8%

i
V*

89.0%

i
WebVoyager

88.5%

i
PinchBench

80.7%

i
SimpleVQA

78.2%

i
AndroidWorld

75.7%

i
Claw-Eval

75.0%

i
MMSearch

72.9%

i
CC-Bench-V2 Repo Exploration

72.2%

i
CC-Bench-V2 Frontend

68.4%

i
OSWorld

62.3%

i
FACTS Grounding

58.6%

i
ZClawBench

57.6%

i
BrowseComp-VL

51.9%

i
Vision2Web

31.0%

i
ImageMining

30.7%

i
MMSearch-Plus

30.0%

i
CC-Bench-V2 Backend

22.8%

i

Pricing, uptime, and speed via OpenRouter — updated Jul 17, 2026, 04:19 AM.

Provider	Status	Input	Output	Limits	Uptime	Speed	Notes
Z.AI	available	$1.20/Mtok cache $0.24/Mtok	$4.00/Mtok	203K tokens context 131K tokens max output	100.0% 5m 100.0%	4,477 ms p50 TTFT 33 tok/s p50	fp8