Google Cloud Vertex AI

cloud.google.com
+
+
+
+
Platform Stats
Total Models13
Organizations2
Verified Benchmarks0
Multimodal Models13
+
+
+
+
Pricing Overview
Avg Input (per 1M)$3.46
Avg Output (per 1M)$18.15
Cheapest Model$0.10
Premium Model$15.00
+
+
+
+
Supported Features
Number of models supporting each feature
web Search
0
function Calling
13
structured Output
13
code Execution
0
batch Inference
4
finetuning
0
+
+
+
+
Input Modalities
Models supporting different input types
text
13 (100%)
image
13 (100%)
audio
4 (31%)
video
4 (31%)
+
+
+
+
Models Overview
Top performers and pricing distribution

Pricing Distribution

Input pricing per 1M tokens
$0-1
2 models
$1-5
8 models
$5-15
2 models
$15+
1 models

Top Performing Models

By benchmark avg
#1Claude Haiku 4.5
73.2%
#2Claude Opus 4.6
73.0%
#3Claude Sonnet 4.6
72.0%
#4Gemini 3 Pro
64.6%
#5Claude Opus 4.5
61.9%

Most Affordable Models

Gemini 3 Flash
$0.10/1M
Gemini 2.5 Flash
$0.15/1M
Claude Haiku 4.5
$1.00/1M

Available Models

13 models available through Google Cloud Vertex AI

LicenseLinks
Anthropic's Claude 4.5 Opus with 200K context, extended thinking, and advanced reasoning for agentic workflows
Nov 1, 2025
Proprietary
80.9%
Anthropic's Claude 4.6 model with adaptive thinking, 200K context, 128K output for complex agentic workflows
Feb 1, 2026
Proprietary
80.8%
Anthropic's Claude Sonnet 4.6 with adaptive thinking, vision, built-in web search, and code execution
Feb 17, 2026
Proprietary
79.6%
Google's Gemini 3 Flash with 1M context, multimodal support, and fast agentic coding capabilities
Dec 17, 2025
Proprietary
78.0%
Google's Gemini 3 Pro with 1M context, multimodal reasoning, and broad world knowledge for complex tasks
Nov 18, 2025
Proprietary
78.0%
Anthropic's Claude 4.5 Sonnet balancing speed and intelligence with 200K context, extended thinking, and vision
Sep 29, 2025
Proprietary
77.2%
Anthropic's Claude Opus 4.1 with extended thinking, 200K context, and strong reasoning for multi-step coding tasks
Aug 5, 2025
Proprietary
74.5%
Anthropic's Claude 4.5 Haiku optimized for low latency with 200K context, extended thinking, and vision support
Oct 1, 2025
Proprietary
73.3%
Google's Gemini 2.5 Flash with 1M context, multimodal input, and hybrid thinking at efficient cost
Jun 17, 2025
Proprietary
-
Google's Gemini 2.5 Pro with 1M context, multimodal reasoning, and strong coding performance
May 20, 2025
Proprietary
-
Showing 1 to 10 of 13 models
+
+
+
+
Resources