DeepSeek R1 Distill Llama 70B

Name: DeepSeek R1 Distill Llama 70B
Price: 0.1 USD
Rating: 76.0 (4 reviews)
Author: DeepSeek

Zero-eval

by DeepSeek

About

DeepSeek-R1-Distill-Llama-70B was created through knowledge distillation from DeepSeek-R1 into a Llama-based architecture, designed to transfer reasoning capabilities to a widely-used open-source foundation. Built to combine DeepSeek's reasoning innovations with Llama's ecosystem compatibility, it enables broader access to advanced reasoning techniques.

Pricing Range

Input (per 1M)$0.10 -$0.10

Output (per 1M)$0.40 -$0.40

Providers1

Timeline

AnnouncedJan 20, 2025

ReleasedJan 20, 2025

Specifications

Training Tokens14.8T

License & Family

License

MIT

Performance Overview

Performance metrics and category breakdown

4 benchmarks

Average Score

76.0%

Best Score

94.5%

High Performers (80%+)

Max Context Window

256.0K

Avg Throughput

37.0 tok/s

Avg Latency

1ms

All Benchmark Results for DeepSeek R1 Distill Llama 70B

Complete list of benchmark scores with detailed information

Resources