DeepSeek VL2

Name: DeepSeek VL2
Price: 9.5 USD
Rating: 70.9 (14 reviews)
Author: DeepSeek

Multimodal

Zero-eval

#1MMT-Bench

#1MME

#3MMBench-V1.1

+1 more

by DeepSeek

About

DeepSeek-VL2 was developed as a vision-language model, designed to handle both visual and textual inputs for multimodal understanding tasks. Built to extend DeepSeek's capabilities beyond text-only processing, it enables applications requiring integrated analysis of images and language.

Pricing Range

Input (per 1M)$9.50 -$9.50

Output (per 1M)$4800.00 -$4800.00

Providers1

Timeline

AnnouncedDec 13, 2024

ReleasedDec 13, 2024

Specifications

Capabilities

Multimodal

License & Family

License

deepseek

Performance Overview

Performance metrics and category breakdown

14 benchmarks

Average Score

70.9%

Best Score

93.3%

High Performers (80%+)

Max Context Window

258.6K

Avg Throughput

22.0 tok/s

Avg Latency

1ms

All Benchmark Results for DeepSeek VL2

Complete list of benchmark scores with detailed information

Showing 1 to 10 of 14 benchmarks

Resources