UI-TARS-2

Multimodal

by ByteDance

+
+
+
+
About

UI-TARS-2, released by ByteDance in September 2025, is a major generational upgrade of the UI-TARS family of GUI interaction models, with enhanced capabilities across computer control, game environments, code generation, and tool use. It targets agentic workflows requiring robust multimodal understanding of graphical interfaces across diverse application domains.

+
+
+
+
Timeline
ReleasedSep 4, 2025
+
+
+
+
Specifications
Capabilities
Multimodal
+
+
+
+
License & Family
License
Apache 2.0
Performance Overview
Performance metrics and category breakdown

Overall Performance

1 benchmarks
Average Score
53.1%
Best Score
53.1%
High Performers (80%+)
0

Top Categories

Agents
53.1%
+
+
+
+
All Benchmark Results for UI-TARS-2
Complete list of benchmark scores with detailed information
OSWorld
Agents
53.10
53.1%
Unverified