
Phi 4 Mini Reasoning
Zero-eval
#1AIME
by Microsoft
+
+
+
+
About
Phi 4 Mini Reasoning is a language model developed by Microsoft. It achieves strong performance with an average score of 68.0% across 3 benchmarks. It excels particularly in MATH-500 (94.6%), AIME (57.5%), GPQA (52.0%). It's licensed for commercial use, making it suitable for enterprise applications. Released in 2025, it represents Microsoft's latest advancement in AI technology.
+
+
+
+
Timeline
AnnouncedApr 30, 2025
ReleasedApr 30, 2025
Knowledge CutoffFeb 1, 2025
+
+
+
+
Specifications
Training Tokens150.0B
+
+
+
+
License & Family
License
MIT
Performance Overview
Performance metrics and category breakdown
Overall Performance
3 benchmarks
Average Score
68.0%
Best Score
94.6%
High Performers (80%+)
1+
+
+
+
All Benchmark Results for Phi 4 Mini Reasoning
Complete list of benchmark scores with detailed information
MATH-500 | text | 0.95 | 94.6% | Self-reported | |
AIME | text | 0.57 | 57.5% | Self-reported | |
GPQA | text | 0.52 | 52.0% | Self-reported |