Microsoft

Phi 4 Mini

Zero-eval
#2Multilingual MMLU
#3OpenBookQA
#3Social IQa
+1 more

by Microsoft

+
+
+
+
About

Phi 4 Mini is a language model developed by Microsoft. It achieves strong performance with an average score of 65.4% across 17 benchmarks. It excels particularly in GSM8k (88.6%), ARC-C (83.7%), BoolQ (81.2%). It's licensed for commercial use, making it suitable for enterprise applications. Released in 2025, it represents Microsoft's latest advancement in AI technology.

+
+
+
+
Timeline
AnnouncedFeb 1, 2025
ReleasedFeb 1, 2025
Knowledge CutoffJun 1, 2024
+
+
+
+
Specifications
Training Tokens5.0T
+
+
+
+
License & Family
License
MIT
Performance Overview
Performance metrics and category breakdown

Overall Performance

17 benchmarks
Average Score
65.4%
Best Score
88.6%
High Performers (80%+)
3
+
+
+
+
All Benchmark Results for Phi 4 Mini
Complete list of benchmark scores with detailed information
GSM8k
text
0.89
88.6%
Self-reported
ARC-C
text
0.84
83.7%
Self-reported
BoolQ
text
0.81
81.2%
Self-reported
OpenBookQA
text
0.79
79.2%
Self-reported
PIQA
text
0.78
77.6%
Self-reported
Social IQa
text
0.72
72.5%
Self-reported
BIG-Bench Hard
text
0.70
70.4%
Self-reported
HellaSwag
text
0.69
69.1%
Self-reported
MMLU
text
0.67
67.3%
Self-reported
Winogrande
text
0.67
67.0%
Self-reported
Showing 1 to 10 of 17 benchmarks