ScienceQA Visual

multimodal
+
+
+
+
About

ScienceQA-Visual is the visual component of the ScienceQA benchmark, specifically focusing on multimodal science questions that require both image understanding and scientific reasoning. This specialized evaluation tests AI models' ability to interpret scientific diagrams, charts, and visual data while applying scientific knowledge to answer complex questions requiring visual-textual integration.

+
+
+
+
Evaluation Stats
Total Models1
Organizations1
Verified Results0
Self-Reported1
+
+
+
+
Benchmark Details
Max Score1
Language
en
+
+
+
+
Performance Overview
Score distribution and top performers

Score Distribution

1 models
Top Score
97.5%
Average Score
97.5%
High Performers (80%+)
1

Top Organizations

#1Microsoft
1 model
97.5%
+
+
+
+
Leaderboard
1 models ranked by performance on ScienceQA Visual
LicenseLinks
Feb 1, 2025
MIT
97.5%
+
+
+
+
Resources