Zhipu AI

z.aiCN

About

Zhipu AI is a Chinese AI company that provides a suite of AI tools and services.

Portfolio Stats

Total Models4

Multimodal2

Benchmarks Run35

Avg Performance46.5%

Latest Release

GLM-4.6

Released: Sep 30, 2025

Multimodal

Release Timeline

Recent model releases by year

2025

4 models

Performance Overview

Top models and benchmark performance

Top Performing Models

By avg score

#1GLM-4.5

64.0%

#2GLM-4.6

61.2%

#3GLM-4.5-Air

60.8%

#4GLM-4.5V

0.0%

Benchmark Categories

Other

62.2%

Model Statistics

Multimodal Ratio

50%

Models with Providers

All Models

Complete portfolio of 4 models with advanced filtering

		License
#01GLM-4.6 GLM-4.6 is the latest version of Z.ai's flagship model, bringing significant improvements over GLM-4.5. Key features include: 200K token context window (expanded from 128K), superior coding performance with better real-world application in Claude Code/Cline/Roo Code/Kilo Code, advanced reasoning with tool use during inference, stronger agent capabilities, and refined writing aligned with human preferences. GLM-4.6 achieves competitive performance with DeepSeek-V3.2-Exp and Claude Sonnet 4, reaching near parity with Claude Sonnet 4 (48.6% win rate) on CC-Bench real-world coding tasks.	Sep 30, 2025	MIT	68.0%	-	-	-	-
#02GLM-4.5 GLM-4.5 is an Agentic, Reasoning, and Coding (ARC) foundation model designed for intelligent agents, featuring 355 billion total parameters with 32 billion active parameters using MoE architecture. Trained on 23T tokens through multi-stage training, it is a hybrid reasoning model that provides two modes: thinking mode for complex reasoning and tool usage, and non-thinking mode for immediate responses. The model unifies agentic, reasoning, and coding capabilities with 128K context length support. It achieves exceptional performance with a score of 63.2 across 12 industry-standard benchmarks, placing 3rd among all proprietary and open-source models. Released under MIT open-source license allowing commercial use and secondary development.	Jul 28, 2025	MIT	64.2%	-	-	72.9%	-
#03GLM-4.5-Air GLM-4.5-Air is a more compact variant of GLM-4.5 designed for efficient Agentic, Reasoning, and Coding (ARC) applications. It features 106 billion total parameters with 12 billion active parameters using MoE architecture. Like GLM-4.5, it is a hybrid reasoning model providing thinking mode for complex reasoning and tool usage, and non-thinking mode for immediate responses. Despite its compact design, GLM-4.5-Air delivers competitive performance with a score of 59.8 across 12 industry-standard benchmarks, ranking 6th overall while maintaining superior efficiency. It supports 128K context length and is released under MIT open-source license allowing commercial use.	Jul 28, 2025	MIT	57.6%	-	-	70.7%	-
#04GLM-4.5V GLM-4.5V is a multimodal (vision-language) model based on GLM-4.5-Air (106B total, 12B active) that extends hybrid reasoning to images and video. It achieves state-of-the-art results across 40+ VLM benchmarks (image reasoning, video understanding, GUI tasks, chart/document parsing, grounding) while supporting a Thinking Mode switch for deep reasoning. Released under MIT with FP8/BF16 variants and tooling in Transformers, vLLM, and SGLang.	Aug 11, 2025	MIT	-	-	-	-	-

Resources

Official Website