Qwen2.5 VL 7B Instruct

Name: Qwen2.5 VL 7B Instruct
Rating: 64.5 (32 reviews)
Author: Alibaba Cloud / Qwen Team

Multimodal

Zero-eval

#1MobileMiniWob++_SR

#1MLVU

#1LongVideoBench

+21 more

by Alibaba Cloud / Qwen Team

About

Qwen2.5-VL 7B was developed as an efficient vision-language model, designed to provide multimodal understanding with minimal computational requirements. Built with 7 billion parameters for integrated visual and textual processing, it serves applications requiring practical vision-language capabilities with constrained resources.

Timeline

AnnouncedJan 26, 2025

ReleasedJan 26, 2025

Specifications

Capabilities

Multimodal

License & Family

License

Apache 2.0

Performance Overview

Performance metrics and category breakdown

32 benchmarks

Average Score

64.5%

Best Score

95.7%

High Performers (80%+)

All Benchmark Results for Qwen2.5 VL 7B Instruct

Complete list of benchmark scores with detailed information

Showing 1 to 10 of 32 benchmarks

Resources