Alibaba Cloud / Qwen Team

Qwen2 7B Instruct

Zero-eval

by Alibaba Cloud / Qwen Team

+
+
+
+
About

Qwen2 7B Instruct is a language model developed by Alibaba Cloud / Qwen Team. The model shows competitive results across 14 benchmarks. It excels particularly in MT-Bench (84.1%), GSM8k (82.3%), HumanEval (79.9%). It's licensed for commercial use, making it suitable for enterprise applications. Released in 2024, it represents Alibaba Cloud / Qwen Team's latest advancement in AI technology.

+
+
+
+
Timeline
AnnouncedJul 23, 2024
ReleasedJul 23, 2024
+
+
+
+
License & Family
License
Apache 2.0
Performance Overview
Performance metrics and category breakdown

Overall Performance

14 benchmarks
Average Score
59.5%
Best Score
84.1%
High Performers (80%+)
2
+
+
+
+
All Benchmark Results for Qwen2 7B Instruct
Complete list of benchmark scores with detailed information
MT-Bench
text
0.84
84.1%
Self-reported
GSM8k
text
0.82
82.3%
Self-reported
HumanEval
text
0.80
79.9%
Self-reported
C-Eval
text
0.77
77.2%
Self-reported
AlignBench
text
0.72
72.1%
Self-reported
MMLU
text
0.70
70.5%
Self-reported
EvalPlus
text
0.70
70.3%
Self-reported
MBPP
text
0.67
67.2%
Self-reported
MultiPL-E
text
0.59
59.1%
Self-reported
MATH
text
0.50
49.6%
Self-reported
Showing 1 to 10 of 14 benchmarks