Devstral Medium vs GPT-4.1 mini: Complete Benchmarks, Speed & Cost Comparison (2025)

Devstral Medium vs GPT-4.1 mini

Comprehensive side-by-side LLM comparison

Devstral Medium leads with 38.0% higher average benchmark score. GPT-4.1 mini offers 824.3K more tokens in context window than Devstral Medium. Both models have similar pricing. GPT-4.1 mini supports multimodal inputs. Overall, Devstral Medium is the stronger choice for coding tasks.

Mistral AI

Devstral Medium was created as a development-focused model, designed to assist with software engineering workflows and developer-centric tasks. Built to provide balanced capability for coding, debugging, and technical documentation, it serves as a versatile tool for professional development environments.

OpenAI

GPT-4.1 Mini was created as a smaller, more efficient variant of GPT-4.1, designed to provide strong capabilities with reduced computational requirements. Built to serve applications where speed and cost are priorities while maintaining solid performance, it extends the GPT-4.1 capabilities to resource-conscious deployments.

2 months newer

GPT-4.1 mini

OpenAI

2025-04-14

Devstral Medium

Mistral AI

2025-07-10

Pricing Comparison

Cost per million tokens (USD)

Devstral Medium

Input:$0.40

Output:$2.00

GPT-4.1 mini

Input:$0.40

Output:$1.60($0.40 cheaper)

Performance Metrics

Context window and performance specifications

Average performance across 1 common benchmarks

Devstral Medium

Average Score:61.6%(+38.0%)

GPT-4.1 mini

Average Score:23.6%

Performance comparison across key benchmark categories

Devstral Medium

Coding61.6%(+38.0%)

GPT-4.1 mini

Coding23.6%

Knowledge Cutoff

Training data recency comparison

GPT-4.1 mini

2024-05-31

More recent knowledge cutoff means awareness of newer technologies and frameworks

Provider Availability & Performance

Available providers and their performance metrics

Devstral Medium

1 providers

Mistral AI

Throughput: 137.1 tok/s

Latency: 0.23ms

GPT-4.1 mini

Devstral Medium

Avg Score:61.6%(+38.0%)

Providers:1

GPT-4.1 mini

Avg Score:23.6%

Providers:2