Comprehensive side-by-side LLM comparison
Mistral Small 3.1 24B Base leads with 5.8% higher average benchmark score. Both models have similar pricing. Mistral Small 3.1 24B Base supports multimodal inputs. Qwen2.5-Coder 32B Instruct is available on 4 providers. Overall, Mistral Small 3.1 24B Base is the stronger choice for coding tasks.
Mistral AI
Mistral Small 3.1 24B Base represents an updated iteration of the 24B foundation model, developed with architectural refinements and improved training. Built to provide enhanced base capabilities for fine-tuning, it incorporates learnings from previous versions for better downstream performance.
Alibaba Cloud / Qwen Team
Qwen 2.5 Coder 32B was developed as a specialized coding model, designed to excel at programming tasks with 32 billion parameters specifically optimized for code. Built to understand and generate code across multiple programming languages, it serves developers requiring advanced code completion, debugging, and explanation capabilities.
5 months newer

Qwen2.5-Coder 32B Instruct
Alibaba Cloud / Qwen Team
2024-09-19

Mistral Small 3.1 24B Base
Mistral AI
2025-03-17
Cost per million tokens (USD)

Mistral Small 3.1 24B Base

Qwen2.5-Coder 32B Instruct
Context window and performance specifications
Average performance across 2 common benchmarks

Mistral Small 3.1 24B Base

Qwen2.5-Coder 32B Instruct
Available providers and their performance metrics

Mistral Small 3.1 24B Base
Mistral AI

Qwen2.5-Coder 32B Instruct

Mistral Small 3.1 24B Base

Qwen2.5-Coder 32B Instruct

Mistral Small 3.1 24B Base

Qwen2.5-Coder 32B Instruct
DeepInfra
Fireworks
Hyperbolic
Lambda