IBM

Granite 3.3 8B Base

Multimodal
Zero-eval
#1AttaQ
#1AlpacaEval 2.0
#1NQ
+2 more

by IBM

+
+
+
+
About

Granite 3.3 8B Base is a multimodal language model developed by IBM. It achieves strong performance with an average score of 64.3% across 20 benchmarks. It excels particularly in HumanEval (89.7%), AttaQ (88.5%), HumanEval+ (86.1%). As a multimodal model, it can process and understand text, images, and other input formats seamlessly. It's licensed for commercial use, making it suitable for enterprise applications. Released in 2025, it represents IBM's latest advancement in AI technology.

+
+
+
+
Timeline
AnnouncedApr 16, 2025
ReleasedApr 16, 2025
Knowledge CutoffApr 1, 2024
+
+
+
+
Specifications
Capabilities
Multimodal
+
+
+
+
License & Family
License
Apache 2.0
Performance Overview
Performance metrics and category breakdown

Overall Performance

20 benchmarks
Average Score
64.3%
Best Score
89.7%
High Performers (80%+)
5
+
+
+
+
All Benchmark Results for Granite 3.3 8B Base
Complete list of benchmark scores with detailed information
HumanEval
text
0.90
89.7%
Self-reported
AttaQ
text
0.89
88.5%
Self-reported
HumanEval+
text
0.86
86.1%
Self-reported
AIME 2024
text
0.81
81.2%
Self-reported
HellaSwag
text
0.80
80.1%
Self-reported
TriviaQA
text
0.78
78.2%
Self-reported
IFEval
text
0.75
74.8%
Self-reported
Winogrande
text
0.74
74.4%
Self-reported
BIG-Bench Hard
text
0.69
69.1%
Self-reported
MATH-500
text
0.69
69.0%
Self-reported
Showing 1 to 10 of 20 benchmarks