
Qwen2 7B Instruct
Zero-eval
by Alibaba Cloud / Qwen Team
+
+
+
+
About
Qwen2 7B Instruct is a language model developed by Alibaba Cloud / Qwen Team. The model shows competitive results across 14 benchmarks. It excels particularly in MT-Bench (84.1%), GSM8k (82.3%), HumanEval (79.9%). It's licensed for commercial use, making it suitable for enterprise applications. Released in 2024, it represents Alibaba Cloud / Qwen Team's latest advancement in AI technology.
+
+
+
+
Timeline
AnnouncedJul 23, 2024
ReleasedJul 23, 2024
+
+
+
+
License & Family
License
Apache 2.0
Performance Overview
Performance metrics and category breakdown
Overall Performance
14 benchmarks
Average Score
59.5%
Best Score
84.1%
High Performers (80%+)
2+
+
+
+
All Benchmark Results for Qwen2 7B Instruct
Complete list of benchmark scores with detailed information
MT-Bench | text | 0.84 | 84.1% | Self-reported | |
GSM8k | text | 0.82 | 82.3% | Self-reported | |
HumanEval | text | 0.80 | 79.9% | Self-reported | |
C-Eval | text | 0.77 | 77.2% | Self-reported | |
AlignBench | text | 0.72 | 72.1% | Self-reported | |
MMLU | text | 0.70 | 70.5% | Self-reported | |
EvalPlus | text | 0.70 | 70.3% | Self-reported | |
MBPP | text | 0.67 | 67.2% | Self-reported | |
MultiPL-E | text | 0.59 | 59.1% | Self-reported | |
MATH | text | 0.50 | 49.6% | Self-reported |
Showing 1 to 10 of 14 benchmarks