GLM-4.5-Air

Zero-eval
#2MATH-500
#2BFCL-v3
#2AA-Index
+3 more

by Zhipu AI

+
+
+
+
About

GLM-4.5-Air is a language model developed by Zhipu AI. It achieves strong performance with an average score of 60.8% across 14 benchmarks. It excels particularly in MATH-500 (98.1%), AIME 2024 (89.4%), MMLU-Pro (81.4%). It's licensed for commercial use, making it suitable for enterprise applications. Released in 2025, it represents Zhipu AI's latest advancement in AI technology.

+
+
+
+
Timeline
AnnouncedJul 28, 2025
ReleasedJul 28, 2025
+
+
+
+
License & Family
License
MIT
Performance Overview
Performance metrics and category breakdown

Overall Performance

14 benchmarks
Average Score
60.8%
Best Score
98.1%
High Performers (80%+)
3
+
+
+
+
All Benchmark Results for GLM-4.5-Air
Complete list of benchmark scores with detailed information
MATH-500
text
0.98
98.1%
Self-reported
AIME 2024
text
0.89
89.4%
Self-reported
MMLU-Pro
text
0.81
81.4%
Self-reported
TAU-bench Retail
text
0.78
77.9%
Self-reported
BFCL-v3
text
0.76
76.4%
Self-reported
GPQA
text
0.75
75.0%
Self-reported
LiveCodeBench
text
0.71
70.7%
Self-reported
AA-Index
text
0.65
64.8%
Self-reported
TAU-bench Airline
text
0.61
60.8%
Self-reported
SWE-Bench Verified
text
0.58
57.6%
Self-reported
Showing 1 to 10 of 14 benchmarks