Comprehensive side-by-side LLM comparison
o3-mini leads with 11.3% higher average benchmark score. o3-mini is available on 2 providers. Overall, o3-mini is the stronger choice for coding tasks.
OpenAI
o3-mini was created as an efficient variant of the o3 reasoning model, designed to provide advanced thinking capabilities with reduced computational requirements. Built to make next-generation reasoning accessible to a broader range of applications, it balances analytical depth with practical speed and cost considerations.
Microsoft
Phi-4 Reasoning was developed to incorporate extended analytical thinking into the Phi-4 architecture, designed to spend more time on complex problem-solving. Built to combine compact model efficiency with reasoning depth, it represents Microsoft's exploration of thoughtful small models.
3 months newer

o3-mini
OpenAI
2025-01-30

Phi 4 Reasoning
Microsoft
2025-04-30
Context window and performance specifications
Average performance across 3 common benchmarks

o3-mini

Phi 4 Reasoning
o3-mini
2023-09-30
Phi 4 Reasoning
2025-03-01
Available providers and their performance metrics

o3-mini
Azure
OpenAI


o3-mini

Phi 4 Reasoning

o3-mini

Phi 4 Reasoning
Phi 4 Reasoning