CMMLU

Multilingual
text
+
+
+
+
About

CMMLU (Chinese Massive Multitask Language Understanding) is a comprehensive benchmark designed to evaluate AI models' performance on Chinese language tasks across multiple domains. Covering reading comprehension, text classification, knowledge reasoning, and specialized subjects, it provides extensive assessment of Chinese language capabilities. CMMLU addresses the significant gap in evaluating Chinese-specific language understanding and cultural knowledge in large language models.

+
+
+
+
Evaluation Stats
Total Models1
Organizations1
Verified Results0
Self-Reported1
+
+
+
+
Benchmark Details
Max Score1
Language
en
+
+
+
+
Performance Overview
Score distribution and top performers

Score Distribution

1 models
Top Score
90.1%
Average Score
90.1%
High Performers (80%+)
1

Top Organizations

#1Alibaba Cloud / Qwen Team
1 model
90.1%
+
+
+
+
Leaderboard
1 models ranked by performance on CMMLU
LicenseLinks
Jul 23, 2024
tongyi-qianwen
90.1%
+
+
+
+
Resources