Comprehensive side-by-side LLM comparison
o3 Pro offers 35.8K more tokens in context window than Devstral-2-123B. Devstral-2-123B is $96.00 cheaper per million tokens. o3 Pro supports multimodal inputs. Both models have their strengths depending on your specific coding needs.
Mistral AI
Devstral 2, released by Mistral AI on December 9, 2025, is a 123 billion parameter dense transformer model specifically designed for software engineering tasks. It features a 256K token context window and achieved 72.2% on SWE-bench Verified at release, making it a competitive open-weight option for automated coding and agentic development. Devstral 2 targets code generation, multi-file software engineering, and agentic development workflows under a modified MIT license.
OpenAI
OpenAI o3-pro, released by OpenAI in June 2025, is an extended reasoning model from the o3 family that applies more compute per response to deliver deeper, more thorough answers on complex problems. It features a 200K token context window, 100K maximum output tokens, and vision capabilities. o3-pro targets research-grade reasoning tasks, extended coding sessions, and applications where accuracy on difficult problems justifies higher inference cost and latency.
6 months newer

o3 Pro
OpenAI
2025-06-10

Devstral-2-123B
Mistral AI
2025-12-09
Cost per million tokens (USD)
Devstral-2-123B
o3 Pro
Context window and performance specifications
Available providers and their performance metrics
Devstral-2-123B
OpenRouter
o3 Pro
Devstral-2-123B
o3 Pro
Devstral-2-123B
o3 Pro
OpenAI