OpenAI MMLU

text

About

OpenAI MMLU is OpenAI's implementation of the Massive Multitask Language Understanding benchmark, evaluating AI models across 57 academic subjects from elementary to professional levels. This comprehensive assessment tests knowledge spanning humanities, STEM, social sciences, and professional domains, serving as a standardized measure of general intelligence and academic competency in language models.

Evaluation Stats

Total Models2

Organizations1

Verified Results0

Self-Reported2

Benchmark Details

Max Score1

Language

Performance Overview

Score distribution and top performers

Score Distribution

2 models

Top Score

35.6%

Average Score

28.9%

High Performers (80%+)

Top Organizations

#1Google

2 models

28.9%

Leaderboard

2 models ranked by performance on OpenAI MMLU

			License		Links
#01Gemma 3n E4B Instructed	Google	Jun 26, 2025	Proprietary	35.6%
#02Gemma 3n E2B Instructed	Google	Jun 26, 2025	Proprietary	22.3%

Resources

Research Paper