Qwen2.5 VL 72B Instruct

Name: Qwen2.5 VL 72B Instruct
Rating: 66.9 (30 reviews)
Author: Alibaba Cloud / Qwen Team

Multimodal

Zero-eval

#1DocVQA

#1Android Control Low_EM

#1OCRBench

+24 more

by Alibaba Cloud / Qwen Team

About

Qwen2.5-VL 72B was created as the flagship vision-language model in the Qwen 2.5 series, designed to provide advanced multimodal understanding. Built with 72 billion parameters optimized for visual and textual reasoning, it represents Qwen's most capable offering for tasks requiring integrated image and language processing.

Timeline

AnnouncedJan 26, 2025

ReleasedJan 26, 2025

Specifications

Capabilities

Multimodal

License & Family

License

tongyi-qianwen

Performance Overview

Performance metrics and category breakdown

30 benchmarks

Average Score

66.9%

Best Score

96.4%

High Performers (80%+)

All Benchmark Results for Qwen2.5 VL 72B Instruct

Complete list of benchmark scores with detailed information

Showing 1 to 10 of 30 benchmarks

Resources