+

Devstral-2-123B vs o3 Pro

Comprehensive side-by-side LLM comparison

o3 Pro offers 35.8K more tokens in context window than Devstral-2-123B. Devstral-2-123B is $96.00 cheaper per million tokens. o3 Pro supports multimodal inputs. Both models have their strengths depending on your specific coding needs.

+

Mistral AI

Devstral 2, released by Mistral AI on December 9, 2025, is a 123 billion parameter dense transformer model specifically designed for software engineering tasks. It features a 256K token context window and achieved 72.2% on SWE-bench Verified at release, making it a competitive open-weight option for automated coding and agentic development. Devstral 2 targets code generation, multi-file software engineering, and agentic development workflows under a modified MIT license.

+

OpenAI

OpenAI o3-pro, released by OpenAI in June 2025, is an extended reasoning model from the o3 family that applies more compute per response to deliver deeper, more thorough answers on complex problems. It features a 200K token context window, 100K maximum output tokens, and vision capabilities. o3-pro targets research-grade reasoning tasks, extended coding sessions, and applications where accuracy on difficult problems justifies higher inference cost and latency.

6 months newer

o3 Pro

OpenAI

2025-06-10

Devstral-2-123B

Mistral AI

2025-12-09

Pricing Comparison

Cost per million tokens (USD)

+

Devstral-2-123B

Input:$1.00

Output:$3.00($96.00 cheaper)

+

o3 Pro

Input:$20.00

Output:$80.00

Performance Metrics

Context window and performance specifications

Provider Availability & Performance

Available providers and their performance metrics

+

Devstral-2-123B

1 providers

OpenRouter

+

o3 Pro

1 providers

+

Devstral-2-123B

Avg Score:0.0%

Providers:1

+

o3 Pro

Avg Score:0.0%

Providers:1

+

Devstral-2-123B

Max Context:264.2K

Parameters:123.0B

+

o3 Pro

Max Context:300.0K(Larger context)

OpenAI