Gemini Diffusion
Zero-eval
#1LBPP (v2)
#1BigCodeBench
#3BIG-Bench Extra Hard
by Google
+
+
+
+
About
Gemini Diffusion was developed as a specialized model for image generation, designed to create high-quality visual content through diffusion-based techniques. Built to complement the text and multimodal capabilities of the Gemini family, it extends Google's AI capabilities into creative visual generation tasks.
+
+
+
+
Timeline
AnnouncedMay 20, 2025
ReleasedMay 20, 2025
+
+
+
+
License & Family
License
Proprietary
Performance Overview
Performance metrics and category breakdown
Overall Performance
10 benchmarks
Average Score
46.9%
Best Score
89.6%
High Performers (80%+)
1+
+
+
+
All Benchmark Results for Gemini Diffusion
Complete list of benchmark scores with detailed information
| HumanEval | text | 0.90 | 89.6% | Self-reported | |
| MBPP | text | 0.76 | 76.0% | Self-reported | |
| Global-MMLU-Lite | text | 0.69 | 69.1% | Self-reported | |
| LBPP (v2) | text | 0.57 | 56.8% | Self-reported | |
| BigCodeBench | text | 0.45 | 45.4% | Self-reported | |
| GPQA | text | 0.40 | 40.4% | Self-reported | |
| LiveCodeBench | text | 0.31 | 30.9% | Self-reported | |
| AIME 2025 | text | 0.23 | 23.3% | Self-reported | |
| SWE-Bench Verified | text | 0.23 | 22.9% | Self-reported | |
| BIG-Bench Extra Hard | text | 0.15 | 15.0% | Self-reported |
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+