Comprehensive side-by-side LLM comparison
DeepSeek-R1-0528 leads with 14.8% higher average benchmark score. DeepSeek-R1-0528 is available on 3 providers. Overall, DeepSeek-R1-0528 is the stronger choice for coding tasks.
DeepSeek
DeepSeek-R1-0528 represents a specific release iteration of the DeepSeek-R1 model, developed to incorporate refinements and improvements from ongoing training. Built to provide enhanced reasoning capabilities based on accumulated insights, it continues the evolution of DeepSeek's reasoning-focused architecture.
Moonshot AI
Kimi K2 Instruct-0905 represents a specific release iteration of the K2 Instruct model, developed to incorporate refinements and improvements. Built to provide enhanced instruction-following based on deployment feedback, it continues the evolution of Moonshot's instruction-tuned offerings.
3 months newer

DeepSeek-R1-0528
DeepSeek
2025-05-28

Kimi K2-Instruct-0905
Moonshot AI
2025-09-05
Context window and performance specifications
Average performance across 11 common benchmarks

DeepSeek-R1-0528

Kimi K2-Instruct-0905
Performance comparison across key benchmark categories

DeepSeek-R1-0528

Kimi K2-Instruct-0905
Available providers and their performance metrics

DeepSeek-R1-0528
DeepInfra
DeepSeek
Novita

DeepSeek-R1-0528

Kimi K2-Instruct-0905

DeepSeek-R1-0528

Kimi K2-Instruct-0905

Kimi K2-Instruct-0905