xAI

Grok-3

Multimodal
Zero-eval
#3AIME 2024

by xAI

+
+
+
+
About

Grok 3 was introduced as xAI's third-generation flagship model, designed to push the boundaries of reasoning, factual accuracy, and helpful assistance. Built to advance the state of AI capabilities, it incorporates improvements across language understanding, generation, and analytical thinking.

+
+
+
+
Pricing Range
Input (per 1M)$3.00 -$3.00
Output (per 1M)$15.00 -$15.00
Providers1
+
+
+
+
Timeline
AnnouncedFeb 17, 2025
ReleasedFeb 17, 2025
Knowledge CutoffNov 17, 2024
+
+
+
+
Specifications
Capabilities
Multimodal
+
+
+
+
License & Family
License
Proprietary
Performance Overview
Performance metrics and category breakdown

Overall Performance

5 benchmarks
Average Score
85.7%
Best Score
93.3%
High Performers (80%+)
3

Performance Metrics

Max Context Window
136.0K
Avg Throughput
100.0 tok/s
Avg Latency
1ms
+
+
+
+
All Benchmark Results for Grok-3
Complete list of benchmark scores with detailed information
AIME 2025
text
0.93
93.3%
Self-reported
AIME 2024
text
0.93
93.3%
Self-reported
GPQA
text
0.85
84.6%
Self-reported
LiveCodeBench
text
0.79
79.4%
Self-reported
MMMU
multimodal
0.78
78.0%
Self-reported
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
Resources