DeepSeek R1 Distill Qwen 1.5B

Name: DeepSeek R1 Distill Qwen 1.5B
Rating: 46.8 (4 reviews)
Author: DeepSeek

Zero-eval

by DeepSeek

About

DeepSeek R1 Distill Qwen 1.5B is a language model developed by DeepSeek. The model shows competitive results across 4 benchmarks. It excels particularly in MATH-500 (83.9%), AIME 2024 (52.7%), GPQA (33.8%). It's licensed for commercial use, making it suitable for enterprise applications. Released in 2025, it represents DeepSeek's latest advancement in AI technology.

Timeline

AnnouncedJan 20, 2025

ReleasedJan 20, 2025

Specifications

Training Tokens14.8T

License & Family

License

MIT

Performance Overview

Performance metrics and category breakdown

4 benchmarks

Average Score

46.8%

Best Score

83.9%

High Performers (80%+)

All Benchmark Results for DeepSeek R1 Distill Qwen 1.5B

Complete list of benchmark scores with detailed information

Resources