Moonshot AI

Kimi K2 0905

Zero-eval
#1HumanEval

by Moonshot AI

+
+
+
+
About

Kimi K2 0905 is a language model developed by Moonshot AI. This model demonstrates exceptional performance with an average score of 84.0% across 6 benchmarks. It excels particularly in HumanEval (94.5%), MMLU (90.2%), MATH (89.1%). It supports a 524K token context window for handling large documents. The model is available through 2 API providers. Released in 2025, it represents Moonshot AI's latest advancement in AI technology.

+
+
+
+
Pricing Range
Input (per 1M)$0.60 -$0.60
Output (per 1M)$2.50 -$2.50
Providers2
+
+
+
+
Timeline
AnnouncedSep 5, 2025
ReleasedSep 5, 2025
+
+
+
+
License & Family
License
Proprietary
Base ModelKimi K2 Instruct
Performance Overview
Performance metrics and category breakdown

Overall Performance

6 benchmarks
Average Score
84.0%
Best Score
94.5%
High Performers (80%+)
4

Performance Metrics

Max Context Window
524.3K
+
+
+
+
All Benchmark Results for Kimi K2 0905
Complete list of benchmark scores with detailed information
HumanEval
text
0.94
94.5%
Self-reported
MMLU
text
0.90
90.2%
Self-reported
MATH
text
0.89
89.1%
Self-reported
MMLU-Pro
text
0.82
82.5%
Self-reported
GPQA
text
0.76
75.8%
Self-reported
AIME 2024
text
0.72
72.0%
Self-reported