GLM-4.5-Air

GLM-4.5-Air is a more compact variant of GLM-4.5 designed for efficient Agentic, Reasoning, and Coding (ARC) applications. It features 106 billion total parameters with 12 billion active parameters using MoE architecture. Like GLM-4.5, it is a hybrid reasoning model providing thinking mode for complex reasoning and tool usage, and non-thinking mode for immediate responses. Despite its compact design, GLM-4.5-Air delivers competitive performance with a score of 59.8 across 12 industry-standard benchmarks, ranking 6th overall while maintaining superior efficiency. It supports 128K context length and is released under MIT open-source license allowing commercial use.

Benchmark results

Benchmark	Score	Tags	Source
AA-Index	64.8%	self-reported llm-stats	link →
AIME 2024	89.4%	self-reported llm-stats	link →
BFCL-v3	76.4%	self-reported llm-stats	link →
BrowseComp	21.3%	self-reported llm-stats	link →
GPQA	75.0%	self-reported llm-stats	link →
HLE	10.6%	self-reported llm-stats	link →
Humanity's Last Exam	10.6%	self-reported llm-stats	link →
LiveCodeBench	70.7%	self-reported llm-stats	link →
MATH-500	98.1%	self-reported llm-stats	link →
MMLU-Pro	81.4%	self-reported llm-stats	link →
SciCode	37.3%	self-reported llm-stats	link →
SWE-Bench Verified	57.6%	self-reported llm-stats	link →
TAU-bench Airline	60.8%	self-reported llm-stats	link →
TAU-bench Retail	77.9%	self-reported llm-stats	link →
Terminal-Bench	30.0%	self-reported llm-stats	link →