Qwen2.5 VL 32B Instruct

Name: Qwen2.5 VL 32B Instruct
Rating: 63.6 (28 reviews)
Author: Alibaba Cloud / Qwen Team

Multimodal

Zero-eval

#1ScreenSpot

#1InfoVQA

#1Android Control High_EM

+15 more

by Alibaba Cloud / Qwen Team

About

Qwen2.5-VL 32B was developed as a mid-sized vision-language model, designed to balance multimodal capability with practical deployment considerations. Built with 32 billion parameters for vision and language integration, it serves applications requiring strong visual understanding without flagship-scale resources.

Timeline

AnnouncedFeb 28, 2025

ReleasedFeb 28, 2025

Specifications

Capabilities

Multimodal

License & Family

License

Apache 2.0

Performance Overview

Performance metrics and category breakdown

28 benchmarks

Average Score

63.6%

Best Score

94.8%

High Performers (80%+)

All Benchmark Results for Qwen2.5 VL 32B Instruct

Complete list of benchmark scores with detailed information

Showing 1 to 10 of 28 benchmarks

Resources