+

Gemma 3 27B vs UI-TARS-72B-DPO

Comprehensive side-by-side LLM comparison

. Both models have their strengths depending on your specific coding needs.

+

Google DeepMind

Gemma 3 27B is a 27-billion-parameter open-weight model from Google DeepMind, released in March 2025 alongside the Gemma 3 12B as the higher-capability variant in the series, built with native vision-language support for text and image inputs across a 128K token context window. Among the Gemma 3 releases, the 27B delivered the strongest results on instruction-following and knowledge-intensive reasoning tasks, making it the preferred option for developers needing greater accuracy from a self-hostable model. Its open-weight availability under a permissive license made it a common starting point for vision-language fine-tuning projects.

+

ByteDance

UI-TARS-72B-DPO, released by ByteDance in early 2025, is a 72 billion parameter multimodal large language model from the UI-TARS family, built on Qwen-2-VL and fine-tuned for automated GUI interaction and computer control. It features native understanding of screenshots, UI elements, and web interfaces, achieving strong results across GUI benchmarks for perception, grounding, and agentic control. UI-TARS-72B-DPO targets computer-use agents, web automation, and applications requiring robust visual UI reasoning.

2 months newer

UI-TARS-72B-DPO

ByteDance

2025-01

Gemma 3 27B

Google DeepMind

2025-03-12

Provider Availability & Performance

Available providers and their performance metrics

+

Gemma 3 27B

0 providers

+

UI-TARS-72B-DPO

0 providers

+

Gemma 3 27B

Avg Score:0.0%

Providers:0

+

UI-TARS-72B-DPO

Avg Score:0.0%

Providers:0