Comprehensive side-by-side LLM comparison
Gemini 2.5 Pro leads with 6.1% higher average benchmark score. Gemini 2.5 Pro offers 860.7K more tokens in context window than Grok 3 mini. Grok 3 mini is $10.45 cheaper per million tokens. Gemini 2.5 Pro supports multimodal inputs. Overall, Gemini 2.5 Pro is the stronger choice for coding tasks.
Google DeepMind
Gemini 2.5 Pro, released by Google in May 2025, is a large language model from the Gemini 2.5 family designed for complex reasoning, coding, and long-context analysis tasks. It features a 1M token context window, native support for text, image, video, and audio input, and integrated thinking capabilities for multi-step problem solving. Gemini 2.5 Pro targets advanced coding workflows, scientific reasoning, and applications requiring deep understanding across large, mixed-modality contexts.
xAI
Grok 3 mini, released by xAI alongside Grok 3 in February 2025, is a compact reasoning model from the Grok 3 family featuring RL-enhanced Think mode for extended chain-of-thought processing. It features a 131K token context window and targets STEM tasks, mathematics, and coding applications where cost-efficient reasoning with configurable depth is required.
3 months newer

Grok 3 mini
xAI
2025-02-17

Gemini 2.5 Pro
Google DeepMind
2025-05-20
Cost per million tokens (USD)
Gemini 2.5 Pro
Grok 3 mini
Context window and performance specifications
Average performance across 1 common benchmarks
Gemini 2.5 Pro
Grok 3 mini
Performance comparison across key benchmark categories
Gemini 2.5 Pro
Grok 3 mini
Available providers and their performance metrics
Gemini 2.5 Pro
Google Cloud Vertex AI
Grok 3 mini
Gemini 2.5 Pro
Grok 3 mini
Gemini 2.5 Pro
Grok 3 mini
xAI