Qwen3-VL Flash

Multimodal

by Alibaba / Qwen

+
+
+
+
About

Qwen3-VL Flash is a lightweight multimodal variant from Alibaba's Qwen3-VL family, designed for efficient visual reasoning and image understanding at lower inference cost. It inherits the joint visual-textual architecture of the Qwen3-VL series and targets latency-sensitive applications requiring multimodal input processing at scale.

+
+
+
+
Timeline
ReleasedJan 22, 2026
+
+
+
+
Specifications
Capabilities
Multimodal
+
+
+
+
License & Family
License
Apache 2.0
Performance Overview
Performance metrics and category breakdown

Overall Performance

1 benchmarks
Average Score
41.6%
Best Score
41.6%
High Performers (80%+)
0

Top Categories

Agents
41.6%
+
+
+
+
All Benchmark Results for Qwen3-VL Flash
Complete list of benchmark scores with detailed information
OSWorld
Agents
41.60
41.6%
Unverified
+
+
+
+
Resources