Comprehensive side-by-side LLM comparison
Kimi K2 0905 leads with 1.8% higher average benchmark score. Kimi K2 0905 offers 196.6K more tokens in context window than DeepSeek-V3.1. DeepSeek-V3.1 is $1.83 cheaper per million tokens. Both models have their strengths depending on your specific coding needs.
DeepSeek
DeepSeek-V3.1 was developed as an incremental advancement over DeepSeek-V3, designed to refine the mixture-of-experts architecture with improved training techniques. Built to enhance quality and efficiency while maintaining the open-source philosophy, it represents continued iteration on DeepSeek's flagship model line.
Moonshot AI
Kimi K2 was introduced as the second generation of Moonshot's language model family, designed to provide enhanced capabilities across language understanding and generation. Built with architectural improvements and expanded training, it represents a significant advancement in Moonshot's model offerings.
7 months newer

DeepSeek-V3.1
DeepSeek
2025-01-10

Kimi K2 0905
Moonshot AI
2025-09-05
Cost per million tokens (USD)

DeepSeek-V3.1

Kimi K2 0905
Context window and performance specifications
Average performance across 3 common benchmarks

DeepSeek-V3.1

Kimi K2 0905
Available providers and their performance metrics

DeepSeek-V3.1
DeepInfra
Novita

Kimi K2 0905

DeepSeek-V3.1

Kimi K2 0905

DeepSeek-V3.1

Kimi K2 0905
Novita
ZeroEval