Comprehensive side-by-side LLM comparison
DeepSeek-R1-0528 leads with 9.0% higher average benchmark score. Kimi K2 0905 offers 262.1K more tokens in context window than DeepSeek-R1-0528. Both models have similar pricing. Overall, DeepSeek-R1-0528 is the stronger choice for coding tasks.
DeepSeek
DeepSeek-R1-0528 represents a specific release iteration of the DeepSeek-R1 model, developed to incorporate refinements and improvements from ongoing training. Built to provide enhanced reasoning capabilities based on accumulated insights, it continues the evolution of DeepSeek's reasoning-focused architecture.
Moonshot AI
Kimi K2 was introduced as the second generation of Moonshot's language model family, designed to provide enhanced capabilities across language understanding and generation. Built with architectural improvements and expanded training, it represents a significant advancement in Moonshot's model offerings.
3 months newer

DeepSeek-R1-0528
DeepSeek
2025-05-28

Kimi K2 0905
Moonshot AI
2025-09-05
Cost per million tokens (USD)

DeepSeek-R1-0528

Kimi K2 0905
Context window and performance specifications
Average performance across 3 common benchmarks

DeepSeek-R1-0528

Kimi K2 0905
Available providers and their performance metrics

DeepSeek-R1-0528
DeepInfra
DeepSeek
Novita

DeepSeek-R1-0528

Kimi K2 0905

DeepSeek-R1-0528

Kimi K2 0905

Kimi K2 0905
Novita
ZeroEval