OpenAI MMLU

text
+
+
+
+
About

OpenAI MMLU is OpenAI's implementation of the Massive Multitask Language Understanding benchmark, evaluating AI models across 57 academic subjects from elementary to professional levels. This comprehensive assessment tests knowledge spanning humanities, STEM, social sciences, and professional domains, serving as a standardized measure of general intelligence and academic competency in language models.

+
+
+
+
Evaluation Stats
Total Models2
Organizations1
Verified Results0
Self-Reported2
+
+
+
+
Benchmark Details
Max Score1
Language
en
+
+
+
+
Performance Overview
Score distribution and top performers

Score Distribution

2 models
Top Score
35.6%
Average Score
28.9%
High Performers (80%+)
0

Top Organizations

#1Google
2 models
28.9%
+
+
+
+
Leaderboard
2 models ranked by performance on OpenAI MMLU
LicenseLinks
Jun 26, 2025
Proprietary
35.6%
Jun 26, 2025
Proprietary
22.3%
+
+
+
+
Resources