Microsoft

Phi 4 Mini Reasoning

Zero-eval
#1AIME

by Microsoft

+
+
+
+
About

Phi 4 Mini Reasoning is a language model developed by Microsoft. It achieves strong performance with an average score of 68.0% across 3 benchmarks. It excels particularly in MATH-500 (94.6%), AIME (57.5%), GPQA (52.0%). It's licensed for commercial use, making it suitable for enterprise applications. Released in 2025, it represents Microsoft's latest advancement in AI technology.

+
+
+
+
Timeline
AnnouncedApr 30, 2025
ReleasedApr 30, 2025
Knowledge CutoffFeb 1, 2025
+
+
+
+
Specifications
Training Tokens150.0B
+
+
+
+
License & Family
License
MIT
Performance Overview
Performance metrics and category breakdown

Overall Performance

3 benchmarks
Average Score
68.0%
Best Score
94.6%
High Performers (80%+)
1
+
+
+
+
All Benchmark Results for Phi 4 Mini Reasoning
Complete list of benchmark scores with detailed information
MATH-500
text
0.95
94.6%
Self-reported
AIME
text
0.57
57.5%
Self-reported
GPQA
text
0.52
52.0%
Self-reported