Kimi K2 0905
Zero-eval
#1HumanEval
by Moonshot AI
+
+
+
+
About
Kimi K2 was introduced as the second generation of Moonshot's language model family, designed to provide enhanced capabilities across language understanding and generation. Built with architectural improvements and expanded training, it represents a significant advancement in Moonshot's model offerings.
+
+
+
+
Pricing Range
Input (per 1M)$0.60 -$0.60
Output (per 1M)$2.50 -$2.50
Providers2
+
+
+
+
Timeline
AnnouncedSep 5, 2025
ReleasedSep 5, 2025
+
+
+
+
License & Family
License
Proprietary
Base ModelKimi K2 Instruct
Performance Overview
Performance metrics and category breakdown
Overall Performance
6 benchmarks
Average Score
84.0%
Best Score
94.5%
High Performers (80%+)
4Performance Metrics
Max Context Window
524.3K+
+
+
+
All Benchmark Results for Kimi K2 0905
Complete list of benchmark scores with detailed information
| HumanEval | text | 0.94 | 94.5% | Self-reported | |
| MMLU | text | 0.90 | 90.2% | Self-reported | |
| MATH | text | 0.89 | 89.1% | Self-reported | |
| MMLU-Pro | text | 0.82 | 82.5% | Self-reported | |
| GPQA | text | 0.76 | 75.8% | Self-reported | |
| AIME 2024 | text | 0.72 | 72.0% | Self-reported |
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+