CBNSL
text
+
+
+
+
About
CBNSL is a specialized benchmark for evaluating Large Language Models in building and coding scenarios, focusing on practical development workflows and code generation capabilities. The benchmark tests models' ability to handle complex coding tasks, agentic coding behaviors, and development-oriented problem-solving. CBNSL provides assessment metrics specifically designed for evaluating AI systems in real-world software development contexts and programming assistance scenarios.
+
+
+
+
Evaluation Stats
Total Models1
Organizations1
Verified Results0
Self-Reported1
+
+
+
+
Benchmark Details
Max Score1
Language
en
+
+
+
+
Performance Overview
Score distribution and top performers
Score Distribution
1 models
Top Score
95.6%
Average Score
95.6%
High Performers (80%+)
1Top Organizations
#1Moonshot AI
1 model
95.6%
+
+
+
+
Leaderboard
1 models ranked by performance on CBNSL
License | Links | ||||
---|---|---|---|---|---|
Jul 11, 2025 | MIT | 95.6% |