Qwen2-VL-72B-Instruct

Name: Qwen2-VL-72B-Instruct
Rating: 75.8 (15 reviews)
Author: Alibaba Cloud / Qwen Team

Multimodal

Zero-eval

#1DocVQAtest

#1VCR_en_easy

#1MMBench_test

+10 more

by Alibaba Cloud / Qwen Team

About

Qwen2-VL 72B was developed as a large vision-language model, designed to handle multimodal tasks combining visual and textual understanding. Built with 72 billion parameters for integrated vision and language processing, it enables applications requiring sophisticated analysis of images alongside text.

Timeline

AnnouncedAug 29, 2024

ReleasedAug 29, 2024

Knowledge CutoffJun 30, 2023

Specifications

Capabilities

Multimodal

License & Family

License

tongyi-qianwen

Performance Overview

Performance metrics and category breakdown

15 benchmarks

Average Score

75.8%

Best Score

96.5%

High Performers (80%+)

All Benchmark Results for Qwen2-VL-72B-Instruct

Complete list of benchmark scores with detailed information

Showing 1 to 10 of 15 benchmarks

Resources