Google Cloud Vertex AI

cloud.google.com
+
+
+
+
Platform Stats
Total Models13
Organizations2
Verified Benchmarks0
Multimodal Models13
+
+
+
+
Pricing Overview
Avg Input (per 1M)$3.46
Avg Output (per 1M)$18.15
Cheapest Model$0.10
Premium Model$15.00
+
+
+
+
Supported Features
Number of models supporting each feature
web Search
0
function Calling
13
structured Output
13
code Execution
0
batch Inference
4
finetuning
0
+
+
+
+
Input Modalities
Models supporting different input types
text
13 (100%)
image
13 (100%)
audio
4 (31%)
video
4 (31%)
+
+
+
+
Models Overview
Top performers and pricing distribution

Pricing Distribution

Input pricing per 1M tokens
$0-1
2 models
$1-5
8 models
$5-15
2 models
$15+
1 models

Top Performing Models

By benchmark avg
#1Claude Haiku 4.5
73.3%
#2Claude Sonnet 4.6
72.0%
#3Claude 3.7 Sonnet
61.8%
#4Gemini 3 Flash
55.6%
#5Gemini 3 Pro
55.2%

Most Affordable Models

Gemini 3 Flash
$0.10/1M
Gemini 2.5 Flash
$0.15/1M
Claude Haiku 4.5
$1.00/1M

Available Models

13 models available through Google Cloud Vertex AI

LicenseLinks
Anthropic's Claude 4.5 Opus with 200K context, extended thinking, and advanced reasoning for agentic workflows
Nov 24, 2025
Proprietary
80.9%
Anthropic's Claude 4.6 model with adaptive thinking, 200K context, 128K output for complex agentic workflows
Feb 5, 2026
Proprietary
80.8%
Anthropic's Claude 4 family model optimized for coding, computer use, and long-context reasoning with 200K context window
Feb 17, 2026
Proprietary
79.6%
Google's Gemini 3 Flash with 1M context, multimodal support, and fast agentic coding capabilities
Dec 17, 2025
Proprietary
78.0%
Google's Gemini 3 Pro with 1M context, multimodal reasoning, and broad world knowledge for complex tasks
Nov 18, 2025
Proprietary
76.2%
Anthropic's Claude Opus 4.1 with extended thinking, 200K context, and strong reasoning for multi-step coding tasks
Aug 5, 2025
Proprietary
74.5%
Anthropic's Claude 4.5 Haiku optimized for low latency with 200K context, extended thinking, and vision support
Oct 15, 2025
Proprietary
73.3%
Google's Gemini 2.5 Flash with 1M context, multimodal input, and hybrid thinking at efficient cost
Apr 17, 2025
Proprietary
-
Google's Gemini 2.5 Pro with 1M context, multimodal reasoning, and strong coding performance
Mar 25, 2025
Proprietary
-
Anthropic's Claude 4.5 Sonnet balancing speed and intelligence with 200K context, extended thinking, and vision
Sep 29, 2025
Proprietary
-
Showing 1 to 10 of 13 models
+
+
+
+
Resources