Google Cloud Vertex AI

cloud.google.com

Platform Stats

Total Models13

Organizations2

Verified Benchmarks0

Multimodal Models13

Pricing Overview

Avg Input (per 1M)$3.46

Avg Output (per 1M)$18.15

Cheapest Model$0.10

Premium Model$15.00

Supported Features

Number of models supporting each feature

web Search

function Calling

structured Output

code Execution

batch Inference

finetuning

Input Modalities

Models supporting different input types

text

13 (100%)

image

13 (100%)

audio

4 (31%)

video

4 (31%)

Models Overview

Top performers and pricing distribution

Pricing Distribution

Input pricing per 1M tokens

$0-1

2 models

$1-5

8 models

$5-15

2 models

$15+

1 models

Top Performing Models

By benchmark avg

#1Claude Haiku 4.5

73.2%

#2Claude Opus 4.6

73.0%

#3Claude Sonnet 4.6

72.0%

#4Gemini 3 Pro

64.6%

#5Claude Opus 4.5

61.9%

Most Affordable Models

Gemini 3 Flash

$0.10/1M

Gemini 2.5 Flash

$0.15/1M

Claude Haiku 4.5

$1.00/1M

Available Models

13 models available through Google Cloud Vertex AI

			License
#01Claude Opus 4.5 Anthropic's Claude 4.5 Opus with 200K context, extended thinking, and advanced reasoning for agentic workflows	Anthropic	Nov 1, 2025	Proprietary	80.9%
#02Claude Opus 4.6 Anthropic's Claude 4.6 model with adaptive thinking, 200K context, 128K output for complex agentic workflows	Anthropic	Feb 1, 2026	Proprietary	80.8%
#03Claude Sonnet 4.6 Anthropic's Claude Sonnet 4.6 with adaptive thinking, vision, built-in web search, and code execution	Anthropic	Feb 17, 2026	Proprietary	79.6%
#04Gemini 3 Flash Google's Gemini 3 Flash with 1M context, multimodal support, and fast agentic coding capabilities	Google DeepMind	Dec 17, 2025	Proprietary	78.0%
#05Gemini 3 Pro Google's Gemini 3 Pro with 1M context, multimodal reasoning, and broad world knowledge for complex tasks	Google DeepMind	Nov 18, 2025	Proprietary	78.0%
#06Claude Sonnet 4.5 Anthropic's Claude 4.5 Sonnet balancing speed and intelligence with 200K context, extended thinking, and vision	Anthropic	Sep 29, 2025	Proprietary	77.2%
#07Claude Opus 4.1 Anthropic's Claude Opus 4.1 with extended thinking, 200K context, and strong reasoning for multi-step coding tasks	Anthropic	Aug 5, 2025	Proprietary	74.5%
#08Claude Haiku 4.5 Anthropic's Claude 4.5 Haiku optimized for low latency with 200K context, extended thinking, and vision support	Anthropic	Oct 1, 2025	Proprietary	73.3%
#09Gemini 2.5 Flash Google's Gemini 2.5 Flash with 1M context, multimodal input, and hybrid thinking at efficient cost	Google DeepMind	Jun 17, 2025	Proprietary	-
#10Gemini 2.5 Pro Google's Gemini 2.5 Pro with 1M context, multimodal reasoning, and strong coding performance	Google DeepMind	May 20, 2025	Proprietary	-

Showing 1 to 10 of 13 models

Resources

Official Website