Moonshot AI

Kimi K2 Base

Zero-eval
#1C-Eval
#1MMLU-redux-2.0
#1TriviaQA
+2 more

by Moonshot AI

+
+
+
+
About

Kimi K2 Base is a language model developed by Moonshot AI. It achieves strong performance with an average score of 69.2% across 13 benchmarks. It excels particularly in C-Eval (92.5%), GSM8k (92.1%), MMLU-redux-2.0 (90.2%). It's licensed for commercial use, making it suitable for enterprise applications. Released in 2025, it represents Moonshot AI's latest advancement in AI technology.

+
+
+
+
Timeline
AnnouncedJul 11, 2025
ReleasedJul 11, 2025
+
+
+
+
Specifications
Training Tokens15.5T
+
+
+
+
License & Family
License
MIT
Performance Overview
Performance metrics and category breakdown

Overall Performance

13 benchmarks
Average Score
69.2%
Best Score
92.5%
High Performers (80%+)
6
+
+
+
+
All Benchmark Results for Kimi K2 Base
Complete list of benchmark scores with detailed information
C-Eval
text
0.93
92.5%
Self-reported
GSM8k
text
0.92
92.1%
Self-reported
MMLU-redux-2.0
text
0.90
90.2%
Self-reported
MMLU
text
0.88
87.8%
Self-reported
TriviaQA
text
0.85
85.1%
Self-reported
EvalPlus
text
0.80
80.3%
Self-reported
CSimpleQA
text
0.78
77.6%
Self-reported
MATH
text
0.70
70.2%
Self-reported
MMLU-Pro
text
0.69
69.2%
Self-reported
GPQA
text
0.48
48.1%
Self-reported
Showing 1 to 10 of 13 benchmarks