Alibaba Cloud / Qwen Team

QvQ-72B-Preview

Multimodal
Zero-eval
#1OlympiadBench
#3MathVision

by Alibaba Cloud / Qwen Team

+
+
+
+
About

QvQ-72B-Preview is a multimodal language model developed by Alibaba Cloud / Qwen Team. The model shows competitive results across 4 benchmarks. Notable strengths include MathVista (71.4%), MMMU (70.3%), MathVision (35.9%). As a multimodal model, it can process and understand text, images, and other input formats seamlessly. It's licensed for commercial use, making it suitable for enterprise applications. Released in 2024, it represents Alibaba Cloud / Qwen Team's latest advancement in AI technology.

+
+
+
+
Timeline
AnnouncedDec 25, 2024
ReleasedDec 25, 2024
+
+
+
+
Specifications
Capabilities
Multimodal
+
+
+
+
License & Family
License
Qwen
Base ModelQwen2-VL-72B-Instruct
Performance Overview
Performance metrics and category breakdown

Overall Performance

4 benchmarks
Average Score
49.5%
Best Score
71.4%
High Performers (80%+)
0
+
+
+
+
All Benchmark Results for QvQ-72B-Preview
Complete list of benchmark scores with detailed information
MathVista
multimodal
0.71
71.4%
Self-reported
MMMU
multimodal
0.70
70.3%
Self-reported
MathVision
multimodal
0.36
35.9%
Self-reported
OlympiadBench
multimodal
0.20
20.4%
Self-reported