
QvQ-72B-Preview
Multimodal
Zero-eval
#1OlympiadBench
#3MathVision
by Alibaba Cloud / Qwen Team
+
+
+
+
About
QvQ-72B-Preview is a multimodal language model developed by Alibaba Cloud / Qwen Team. The model shows competitive results across 4 benchmarks. Notable strengths include MathVista (71.4%), MMMU (70.3%), MathVision (35.9%). As a multimodal model, it can process and understand text, images, and other input formats seamlessly. It's licensed for commercial use, making it suitable for enterprise applications. Released in 2024, it represents Alibaba Cloud / Qwen Team's latest advancement in AI technology.
+
+
+
+
Timeline
AnnouncedDec 25, 2024
ReleasedDec 25, 2024
+
+
+
+
Specifications
Capabilities
Multimodal
+
+
+
+
License & Family
License
Qwen
Base ModelQwen2-VL-72B-Instruct
Performance Overview
Performance metrics and category breakdown
Overall Performance
4 benchmarks
Average Score
49.5%
Best Score
71.4%
High Performers (80%+)
0+
+
+
+
All Benchmark Results for QvQ-72B-Preview
Complete list of benchmark scores with detailed information
MathVista | multimodal | 0.71 | 71.4% | Self-reported | |
MMMU | multimodal | 0.70 | 70.3% | Self-reported | |
MathVision | multimodal | 0.36 | 35.9% | Self-reported | |
OlympiadBench | multimodal | 0.20 | 20.4% | Self-reported |