Comprehensive side-by-side LLM comparison
Devstral-2-123B offers 192 more tokens in context window than Claude Sonnet 4.6. Devstral-2-123B is $14.00 cheaper per million tokens. Claude Sonnet 4.6 supports multimodal inputs. Claude Sonnet 4.6 is available on 3 providers. Both models have their strengths depending on your specific coding needs.
Anthropic
Claude Sonnet 4.6 is a general-purpose language model from Anthropic, released in February 2026 as an update to the Sonnet 4 line that introduced adaptive thinking — a mode where the model automatically calibrates its reasoning depth based on task complexity rather than requiring manual configuration by the developer. The model accepts text and image inputs and integrates natively with web search and code execution tools, consolidating capabilities that previously required separate toolchain setup into a unified API surface. It became the primary workhorse model in the Claude 4 series for code assistance, agentic pipelines, and retrieval-augmented applications that benefit from built-in web access.
Mistral AI
Devstral 2, released by Mistral AI on December 9, 2025, is a 123 billion parameter dense transformer model specifically designed for software engineering tasks. It features a 256K token context window and achieved 72.2% on SWE-bench Verified at release, making it a competitive open-weight option for automated coding and agentic development. Devstral 2 targets code generation, multi-file software engineering, and agentic development workflows under a modified MIT license.
2 months newer

Devstral-2-123B
Mistral AI
2025-12-09

Claude Sonnet 4.6
Anthropic
2026-02-17
Cost per million tokens (USD)
Claude Sonnet 4.6
Devstral-2-123B
Context window and performance specifications
Claude Sonnet 4.6
2025-08
Available providers and their performance metrics
Claude Sonnet 4.6
Anthropic
AWS Bedrock
Google Cloud Vertex AI
Devstral-2-123B
Claude Sonnet 4.6
Devstral-2-123B
Claude Sonnet 4.6
Devstral-2-123B
OpenRouter