Comprehensive side-by-side LLM comparison
Phi 4 leads with 13.9% higher average benchmark score. Claude 3 Haiku offers 368.0K more tokens in context window than Phi 4. Phi 4 is $1.29 cheaper per million tokens. Claude 3 Haiku supports multimodal inputs. Claude 3 Haiku is available on 3 providers. Overall, Phi 4 is the stronger choice for coding tasks.
Anthropic
Claude 3 Haiku was created as the fastest and most affordable model in its intelligence class, processing 21K tokens per second for prompts under 32K tokens. Built for enterprise workloads requiring quick analysis of large datasets, it combines state-of-the-art vision capabilities with strong performance on industry benchmarks while prioritizing speed, affordability, and enterprise-grade security.
Microsoft
Phi-4 was introduced as the fourth generation of Microsoft's small language model series, designed to push the boundaries of what compact models can achieve. Built with advanced training techniques and architectural improvements, it demonstrates continued progress in efficient, high-quality language models.
9 months newer

Claude 3 Haiku
Anthropic
2024-03-13

Phi 4
Microsoft
2024-12-12
Cost per million tokens (USD)

Claude 3 Haiku

Phi 4
Context window and performance specifications
Average performance across 6 common benchmarks

Claude 3 Haiku

Phi 4
Phi 4
2024-06-01
Available providers and their performance metrics

Claude 3 Haiku
Anthropic
Bedrock

Claude 3 Haiku

Phi 4

Claude 3 Haiku

Phi 4

Phi 4
DeepInfra