Claude Opus 4.1
Multimodal
#3GDPVal
by Anthropic
+
+
+
+
About
Claude Opus 4.1, released by Anthropic in August 2025, is built for multi-step agentic tasks requiring sustained reasoning across long sessions — designed to orchestrate complex sequences of actions, delegate to sub-agents, and maintain coherent goals over extended autonomous work. It targets applications where a model must reliably execute as an orchestrator rather than just a responder.
+
+
+
+
Pricing Range
Input (per 1M)$15.00 -$15.00
Output (per 1M)$75.00 -$75.00
Providers3
+
+
+
+
Timeline
ReleasedAug 5, 2025
Knowledge CutoffJan 1, 2025
+
+
+
+
Specifications
Capabilities
Multimodal
+
+
+
+
License & Family
License
Proprietary
Performance Overview
Performance metrics and category breakdown
Overall Performance
6 benchmarks
Average Score
28.4%
Best Score
74.5%
High Performers (80%+)
0Performance Metrics
Max Context Window
232.0KTop Categories
Coding
74.5%
Tool Use
40.9%
Agents
22.3%
Reasoning
11.5%
Multimodal
-1.0%
+
+
+
+
All Benchmark Results for Claude Opus 4.1
Complete list of benchmark scores with detailed information
| SWE Bench Verified | Coding | 74.50 | 74.5% | Unverified | |
| GDPVal | Agents | 43.60 | 43.6% | Unverified | |
| MCP-Atlas | Tool Use | 40.90 | 40.9% | Unverified | |
| Humanity's Last Exam | Reasoning | 11.52 | 11.5% | Unverified | |
| BrowseComp | Agents | 1.00 | 1.0% | Unverified | |
| MMMU | Multimodal | -1.00 | -1.0% | Unverified |