CRAG

text

About

CRAG (Comprehensive RAG Benchmark) is an evaluation framework designed to assess Retrieval-Augmented Generation systems' performance across diverse information retrieval and generation scenarios. This comprehensive benchmark tests AI models' ability to retrieve relevant information and generate accurate, contextually appropriate responses. CRAG provides thorough evaluation of RAG systems' effectiveness in real-world information synthesis and knowledge integration tasks.

Evaluation Stats

Total Models3

Organizations1

Verified Results0

Self-Reported3

Benchmark Details

Max Score1

Language

Performance Overview

Score distribution and top performers

Score Distribution

3 models

Top Score

50.3%

Average Score

45.7%

High Performers (80%+)

Top Organizations

#1Amazon

3 models

45.7%

Leaderboard

3 models ranked by performance on CRAG

			License
#01Nova Pro	Amazon	Nov 20, 2024	Proprietary	50.3%
#02Nova Lite	Amazon	Nov 20, 2024	Proprietary	43.8%
#03Nova Micro	Amazon	Nov 20, 2024	Proprietary	43.1%

Resources

Research Paper