Qwen3-VL Flash
Multimodal
by Alibaba / Qwen
+
+
+
+
About
Qwen3-VL Flash is a lightweight variant of the Qwen3-VL visual language model family, built for efficient multimodal inference where cost and latency matter more than maximum accuracy. It targets production deployments that require image understanding at scale without the inference overhead of the full Qwen3-VL-235B, serving as the accessible tier of Alibaba's multimodal model lineup.
+
+
+
+
Timeline
ReleasedJan 22, 2026
+
+
+
+
Specifications
Capabilities
Multimodal
+
+
+
+
License & Family
License
Apache 2.0
Performance Overview
Performance metrics and category breakdown
Overall Performance
1 benchmarks
Average Score
41.6%
Best Score
41.6%
High Performers (80%+)
0Top Categories
Agents
41.6%
+
+
+
+
All Benchmark Results for Qwen3-VL Flash
Complete list of benchmark scores with detailed information
| OSWorld | Agents | 41.60 | 41.6% | Unverified |