RepoBench

text

About

RepoBench is a comprehensive benchmark specifically designed for evaluating repository-level code auto-completion systems, testing models' ability to understand cross-file dependencies and demonstrate repository-level understanding. This evaluation goes beyond single-file code completion, challenging AI models to comprehend complex codebases, maintain context across multiple files, and generate contextually appropriate code completions.

Evaluation Stats

Total Models1

Organizations1

Verified Results0

Self-Reported1

Benchmark Details

Max Score1

Language

Performance Overview

Score distribution and top performers

Score Distribution

1 models

Top Score

34.0%

Average Score

34.0%

High Performers (80%+)

Top Organizations

#1Mistral AI

1 model

34.0%

Leaderboard

1 models ranked by performance on RepoBench

			License		Links
#01Codestral-22B	Mistral AI	May 29, 2024	MNPL-0.1	34.0%

Resources

Research Paper