OCRBench

multimodal
+
+
+
+
About

OCRBench is a comprehensive optical character recognition benchmark for evaluating text recognition capabilities of multimodal models. It tests models' ability to accurately recognize, extract, and understand text from images across various formats, fonts, languages, and visual contexts, providing systematic assessment of OCR performance and text-centric visual understanding.

+
+
+
+
Evaluation Stats
Total Models7
Organizations3
Verified Results0
Self-Reported7
+
+
+
+
Benchmark Details
Max Score1
Language
en
+
+
+
+
Performance Overview
Score distribution and top performers

Score Distribution

7 models
Top Score
88.5%
Average Score
84.6%
High Performers (80%+)
7

Top Organizations

#1Alibaba Cloud / Qwen Team
3 models
87.5%
#2Microsoft
1 model
84.4%
#3DeepSeek
3 models
81.8%
+
+
+
+
Leaderboard
7 models ranked by performance on OCRBench
LicenseLinks
Jan 26, 2025
tongyi-qianwen
88.5%
Aug 29, 2024
tongyi-qianwen
87.7%
Jan 26, 2025
Apache 2.0
86.4%
Feb 1, 2025
MIT
84.4%
Dec 13, 2024
deepseek
83.4%
Dec 13, 2024
deepseek
81.1%
Dec 13, 2024
deepseek
80.9%
+
+
+
+
Resources