Gemini 2.5 Flash-Lite vs Qwen2.5 72B Instruct: Complete Benchmarks, Speed & Cost Comparison (2026)

Gemini 2.5 Flash-Lite vs Qwen2.5 72B Instruct

Comprehensive side-by-side LLM comparison

Qwen2.5 72B Instruct leads with 3.1% higher average benchmark score. Gemini 2.5 Flash-Lite offers 974.8K more tokens in context window than Qwen2.5 72B Instruct. Both models have similar pricing. Gemini 2.5 Flash-Lite supports multimodal inputs. Qwen2.5 72B Instruct is available on 4 providers. Both models have their strengths depending on your specific coding needs.

Google

Gemini 2.5 Flash Lite was created as the most efficient option in the Gemini 2.5 family, designed to provide cutting-edge capabilities with minimal computational overhead. Built for applications where cost and latency are primary concerns, it extends advanced multimodal understanding to resource-conscious deployments.

Alibaba Cloud / Qwen Team

Qwen 2.5 72B was developed as the flagship text model in the Qwen 2.5 series, designed to provide advanced language capabilities with 72 billion parameters. Built to compete with frontier models in reasoning, coding, and general language tasks, it represents Qwen's most capable instruction-following model in this generation.

9 months newer

Qwen2.5 72B Instruct

Alibaba Cloud / Qwen Team

2024-09-19

Gemini 2.5 Flash-Lite

Google

2025-06-17

Pricing Comparison

Cost per million tokens (USD)

Gemini 2.5 Flash-Lite

Input:$0.10

Output:$0.40($0.25 cheaper)

Qwen2.5 72B Instruct

Input:$0.35

Output:$0.40

Performance Metrics

Context window and performance specifications

Average performance across 2 common benchmarks

Gemini 2.5 Flash-Lite

Average Score:49.2%

Qwen2.5 72B Instruct

Average Score:52.3%(+3.1%)

Knowledge Cutoff

Training data recency comparison

Gemini 2.5 Flash-Lite

2025-01-01

More recent knowledge cutoff means awareness of newer technologies and frameworks

Provider Availability & Performance

Available providers and their performance metrics

Gemini 2.5 Flash-Lite

1 providers

Google

Throughput: 5.69 tok/s

Latency: 0.44ms

Qwen2.5 72B Instruct

Gemini 2.5 Flash-Lite

Avg Score:49.2%

Providers:1

Qwen2.5 72B Instruct

Avg Score:52.3%(+3.1%)

Providers:4