Zhipu AI

+
+
+
+
About

Zhipu AI is a Chinese AI company that provides a suite of AI tools and services.

+
+
+
+
Portfolio Stats
Total Models4
Multimodal2
Benchmarks Run35
Avg Performance46.5%
+
+
+
+
Latest Release
GLM-4.6
Released: Sep 30, 2025
Multimodal
+
+
+
+
Release Timeline
Recent model releases by year
2025
4 models
+
+
+
+
Performance Overview
Top models and benchmark performance

Top Performing Models

By avg score
64.0%
61.2%
0.0%

Benchmark Categories

Other
35
62.2%

Model Statistics

Multimodal Ratio
50%
Models with Providers
3

All Models

Complete portfolio of 4 models with advanced filtering

LicenseLinks
#01GLM-4.6
GLM-4.6 is the latest version of Z.ai's flagship model, bringing significant improvements over GLM-4.5. Key features include: 200K token context window (expanded from 128K), superior coding performance with better real-world application in Claude Code/Cline/Roo Code/Kilo Code, advanced reasoning with tool use during inference, stronger agent capabilities, and refined writing aligned with human preferences. GLM-4.6 achieves competitive performance with DeepSeek-V3.2-Exp and Claude Sonnet 4, reaching near parity with Claude Sonnet 4 (48.6% win rate) on CC-Bench real-world coding tasks.
Sep 30, 2025
MIT
68.0%----
#02GLM-4.5
GLM-4.5 is an Agentic, Reasoning, and Coding (ARC) foundation model designed for intelligent agents, featuring 355 billion total parameters with 32 billion active parameters using MoE architecture. Trained on 23T tokens through multi-stage training, it is a hybrid reasoning model that provides two modes: thinking mode for complex reasoning and tool usage, and non-thinking mode for immediate responses. The model unifies agentic, reasoning, and coding capabilities with 128K context length support. It achieves exceptional performance with a score of 63.2 across 12 industry-standard benchmarks, placing 3rd among all proprietary and open-source models. Released under MIT open-source license allowing commercial use and secondary development.
Jul 28, 2025
MIT
64.2%--72.9%-
#03GLM-4.5-Air
GLM-4.5-Air is a more compact variant of GLM-4.5 designed for efficient Agentic, Reasoning, and Coding (ARC) applications. It features 106 billion total parameters with 12 billion active parameters using MoE architecture. Like GLM-4.5, it is a hybrid reasoning model providing thinking mode for complex reasoning and tool usage, and non-thinking mode for immediate responses. Despite its compact design, GLM-4.5-Air delivers competitive performance with a score of 59.8 across 12 industry-standard benchmarks, ranking 6th overall while maintaining superior efficiency. It supports 128K context length and is released under MIT open-source license allowing commercial use.
Jul 28, 2025
MIT
57.6%--70.7%-
#04GLM-4.5V
GLM-4.5V is a multimodal (vision-language) model based on GLM-4.5-Air (106B total, 12B active) that extends hybrid reasoning to images and video. It achieves state-of-the-art results across 40+ VLM benchmarks (image reasoning, video understanding, GUI tasks, chart/document parsing, grounding) while supporting a Thinking Mode switch for deep reasoning. Released under MIT with FP8/BF16 variants and tooling in Transformers, vLLM, and SGLang.
Aug 11, 2025
MIT
-----
+
+
+
+
Resources