BrowseComp

Agents
+
+
+
+
About

BrowseComp evaluates AI agent web browsing and information-seeking persistence on 1,266 tasks requiring navigation of the live internet to find entangled, hard-to-locate information.

+
+
+
+
Evaluation Stats
Total Models23
Organizations5
Verified Results0
Self-Reported5
+
+
+
+
Benchmark Details
Max Score100
+
+
+
+
Performance Overview
Score distribution and top performers

Score Distribution

23 models
Top Score
84.0%
Average Score
20.3%
High Performers (80%+)
1

Top Organizations

#1Anthropic
6 models
45.4%
#2Google DeepMind
3 models
23.2%
#3OpenAI
9 models
12.9%
#4DeepSeek
2 models
2.2%
#5xAI
3 models
1.3%
+
+
+
+
Leaderboard
23 models ranked by performance on BrowseComp
LicenseLinks
Feb 1, 2026
Proprietary
84.0%
Dec 11, 2025
Proprietary
77.9%
Feb 17, 2026
Proprietary
74.7%
Nov 1, 2025
Proprietary
67.8%
Nov 18, 2025
Proprietary
59.2%
Sep 29, 2025
Proprietary
43.9%
Aug 7, 2025
Proprietary
20.1%
May 20, 2025
Proprietary
7.8%
Dec 5, 2024
Proprietary
6.3%
Apr 14, 2025
Proprietary
3.6%
Showing 1 to 10 of 23 models
+
+
+
+
Additional Metrics
Extended metrics for top models on BrowseComp