CBNSL

text

About

CBNSL is a specialized benchmark for evaluating Large Language Models in building and coding scenarios, focusing on practical development workflows and code generation capabilities. The benchmark tests models' ability to handle complex coding tasks, agentic coding behaviors, and development-oriented problem-solving. CBNSL provides assessment metrics specifically designed for evaluating AI systems in real-world software development contexts and programming assistance scenarios.

Evaluation Stats

Total Models1

Organizations1

Verified Results0

Self-Reported1

Benchmark Details

Max Score1

Language

Performance Overview

Score distribution and top performers

Score Distribution

1 models

Top Score

95.6%

Average Score

95.6%

High Performers (80%+)

Top Organizations

#1Moonshot AI

1 model

95.6%

Leaderboard

1 models ranked by performance on CBNSL

			License		Links
#01Kimi K2 Instruct	Moonshot AI	Jul 11, 2025	MIT	95.6%

Resources

Research Paper