CBNSL

text
+
+
+
+
About

CBNSL is a specialized benchmark for evaluating Large Language Models in building and coding scenarios, focusing on practical development workflows and code generation capabilities. The benchmark tests models' ability to handle complex coding tasks, agentic coding behaviors, and development-oriented problem-solving. CBNSL provides assessment metrics specifically designed for evaluating AI systems in real-world software development contexts and programming assistance scenarios.

+
+
+
+
Evaluation Stats
Total Models1
Organizations1
Verified Results0
Self-Reported1
+
+
+
+
Benchmark Details
Max Score1
Language
en
+
+
+
+
Performance Overview
Score distribution and top performers

Score Distribution

1 models
Top Score
95.6%
Average Score
95.6%
High Performers (80%+)
1

Top Organizations

#1Moonshot AI
1 model
95.6%
+
+
+
+
Leaderboard
1 models ranked by performance on CBNSL
LicenseLinks
Jul 11, 2025
MIT
95.6%
+
+
+
+
Resources