GLM-5

GLM-5 is Zhipu AI's flagship foundation model designed for complex system engineering and long-range Agent tasks, shifting focus from coding to engineering. It features 744B total parameters (40B activated) in a Mixture of Experts architecture, trained on 28.5T tokens. GLM-5 integrates DeepSeek Sparse Attention for higher token efficiency while preserving long-context quality. It supports 200K context length and 128K max output tokens, with capabilities including thinking modes, real-time streaming, function calling, context caching, and structured output. GLM-5 approaches Claude Opus 4.5 in code-logic density and systems-engineering capability.

Benchmark results

Benchmark Score Tags Source
BrowseComp 75.9% self-reported llm-stats link →
MCP Atlas 67.8% self-reported llm-stats link →
SWE-Bench Verified 77.8% self-reported llm-stats link →
t2-bench 89.7% self-reported llm-stats link →
Terminal-Bench 2.0 56.2% self-reported llm-stats link →