Qwen3-VL Flash
Multimodal
by Alibaba / Qwen
+
+
+
+
About
Qwen3-VL Flash is a lightweight multimodal variant from Alibaba's Qwen3-VL family, designed for efficient visual reasoning and image understanding at lower inference cost. It inherits the joint visual-textual architecture of the Qwen3-VL series and targets latency-sensitive applications requiring multimodal input processing at scale.
+
+
+
+
Timeline
ReleasedJan 22, 2026
+
+
+
+
Specifications
Capabilities
Multimodal
+
+
+
+
License & Family
License
Apache 2.0
Performance Overview
Performance metrics and category breakdown
Overall Performance
1 benchmarks
Average Score
41.6%
Best Score
41.6%
High Performers (80%+)
0Top Categories
Agents
41.6%
+
+
+
+
All Benchmark Results for Qwen3-VL Flash
Complete list of benchmark scores with detailed information
| OSWorld | Agents | 41.60 | 41.6% | Unverified |