DeepSeek R1 Zero

Name: DeepSeek R1 Zero
Rating: 76.5 (4 reviews)
Author: DeepSeek

Zero-eval

by DeepSeek

About

DeepSeek-R1-Zero was introduced as an experimental variant trained with minimal human supervision, designed to develop reasoning patterns through self-guided reinforcement learning. Built to explore how models can discover analytical strategies independently, it represents research into autonomous reasoning capability development.

Timeline

AnnouncedJan 20, 2025

ReleasedJan 20, 2025

Specifications

Training Tokens14.8T

License & Family

License

MIT

Base ModelDeepSeek-V3

Performance Overview

Performance metrics and category breakdown

4 benchmarks

Average Score

76.5%

Best Score

95.9%

High Performers (80%+)

All Benchmark Results for DeepSeek R1 Zero

Complete list of benchmark scores with detailed information

Resources