AI2D

multimodal
+
+
+
+
About

AI2D is a comprehensive AI benchmark for diagram understanding featuring 4,903 annotated diagrams and 4,563 questions. It evaluates AI systems' visual reasoning abilities through object segmentation, diagrammatic element recognition, and question answering from scientific illustrations. This dataset advances machine learning in diagram comprehension, computer vision, and intelligent document understanding for educational content.

+
+
+
+
Evaluation Stats
Total Models17
Organizations9
Verified Results0
Self-Reported17
+
+
+
+
Benchmark Details
Max Score1
Language
en
+
+
+
+
Performance Overview
Score distribution and top performers

Score Distribution

17 models
Top Score
94.7%
Average Score
85.6%
High Performers (80%+)
14

Top Organizations

#1Anthropic
1 model
94.7%
#2OpenAI
1 model
94.2%
#3Mistral AI
2 models
93.4%
#4Meta
2 models
91.7%
#5xAI
1 model
88.3%
+
+
+
+
Leaderboard
17 models ranked by performance on AI2D
LicenseLinks
Oct 22, 2024
Proprietary
94.7%
Aug 6, 2024
Proprietary
94.2%
Nov 18, 2024
Mistral Research License (MRL) for research; Mistral Commercial License for commercial use
93.8%
Jun 20, 2025
Apache 2.0
92.9%
Sep 25, 2024
Llama 3.2
92.3%
Sep 25, 2024
Llama 3.2 Community License
91.1%
Jan 26, 2025
tongyi-qianwen
88.4%
Apr 12, 2024
Proprietary
88.3%
Mar 12, 2025
Gemma
84.5%
Mar 12, 2025
Gemma
84.2%
Showing 1 to 10 of 17 models