CMMLU

Multilingual

text

About

CMMLU (Chinese Massive Multitask Language Understanding) is a comprehensive benchmark designed to evaluate AI models' performance on Chinese language tasks across multiple domains. Covering reading comprehension, text classification, knowledge reasoning, and specialized subjects, it provides extensive assessment of Chinese language capabilities. CMMLU addresses the significant gap in evaluating Chinese-specific language understanding and cultural knowledge in large language models.

Evaluation Stats

Total Models1

Organizations1

Verified Results0

Self-Reported1

Benchmark Details

Max Score1

Language

Performance Overview

Score distribution and top performers

Score Distribution

1 models

Top Score

90.1%

Average Score

90.1%

High Performers (80%+)

Top Organizations

#1Alibaba Cloud / Qwen Team

1 model

90.1%

Leaderboard

1 models ranked by performance on CMMLU

			License		Links
#01Qwen2 72B Instruct	Alibaba Cloud / Qwen Team	Jul 23, 2024	tongyi-qianwen	90.1%

Resources

Research Paper