GLM-5
GLM-5 is Zhipu AI's flagship foundation model designed for complex system engineering and long-range Agent tasks, shifting focus from coding to engineering. It features 744B total parameters (40B activated) in a Mixture of Experts architecture, trained on 28.5T tokens. GLM-5 integrates DeepSeek Sparse Attention for higher token efficiency while preserving long-context quality. It supports 200K context length and 128K max output tokens, with capabilities including thinking modes, real-time streaming, function calling, context caching, and structured output. GLM-5 approaches Claude Opus 4.5 in code-logic density and systems-engineering capability.
Benchmark results
| Benchmark | Score | Tags | Source |
|---|---|---|---|
| BrowseComp | 75.9% | self-reported llm-stats | link → |
| MCP Atlas | 67.8% | self-reported llm-stats | link → |
| SWE-Bench Verified | 77.8% | self-reported llm-stats | link → |
| t2-bench | 89.7% | self-reported llm-stats | link → |
| Terminal-Bench 2.0 | 56.2% | self-reported llm-stats | link → |