OJBench

text
+
+
+
+
About

OJBench is a competition-level code benchmark comprising 232 challenging programming problems designed to assess large language models' code reasoning abilities in competitive programming contexts. It evaluates models' capacity to solve complex algorithmic problems, implement efficient solutions, and demonstrate advanced programming skills at the level required for programming competitions.

+
+
+
+
Evaluation Stats
Total Models4
Organizations2
Verified Results0
Self-Reported4
+
+
+
+
Benchmark Details
Max Score1
Language
en
+
+
+
+
Performance Overview
Score distribution and top performers

Score Distribution

4 models
Top Score
32.5%
Average Score
29.1%
High Performers (80%+)
0

Top Organizations

#1Alibaba Cloud / Qwen Team
2 models
31.1%
#2Moonshot AI
2 models
27.1%
+
+
+
+
Leaderboard
4 models ranked by performance on OJBench
LicenseLinks
Jul 25, 2025
Apache 2.0
32.5%
Sep 10, 2025
Apache 2.0
29.7%
Jul 11, 2025
MIT
27.1%
Sep 5, 2025
MIT
27.1%
+
+
+
+
Resources