DeepSeek-V3

Name: DeepSeek-V3
Price: 0.27 USD
Rating: 67.2 (20 reviews)
Author: DeepSeek

Zero-eval

#1DROP

#1HumanEval-Mul

#1Aider-Polyglot Edit

+5 more

by DeepSeek

About

DeepSeek-V3 was introduced as a major architectural advancement, developed with 671B mixture-of-experts parameters and trained on 14.8 trillion tokens. Built to be three times faster than V2 while maintaining open-source availability, it demonstrates competitive performance against frontier closed-source models and represents a significant leap in efficient large-scale model design.

Pricing Range

Input (per 1M)$0.27 -$0.27

Output (per 1M)$1.10 -$1.10

Providers1

Timeline

AnnouncedDec 25, 2024

ReleasedDec 25, 2024

Specifications

Training Tokens14.8T

License & Family

License

MIT + Model License (Commercial use allowed)

Performance Overview

Performance metrics and category breakdown

20 benchmarks

Average Score

67.2%

Best Score

91.6%

High Performers (80%+)

Max Context Window

262.1K

Avg Throughput

100.0 tok/s

Avg Latency

1ms

All Benchmark Results for DeepSeek-V3

Complete list of benchmark scores with detailed information

Showing 1 to 10 of 20 benchmarks

Resources