Grok-1.5V

Multimodal

Zero-eval

#3RealWorldQA

by xAI

About

Grok-1.5V is a multimodal language model developed by xAI. It achieves strong performance with an average score of 71.9% across 7 benchmarks. It excels particularly in AI2D (88.3%), DocVQA (85.6%), TextVQA (78.1%). As a multimodal model, it can process and understand text, images, and other input formats seamlessly. Released in 2024, it represents xAI's latest advancement in AI technology.

Timeline

AnnouncedApr 12, 2024

ReleasedApr 12, 2024

Specifications

Capabilities

Multimodal

License & Family

License

Proprietary

Performance Overview

Performance metrics and category breakdown

7 benchmarks

Average Score

71.9%

Best Score

88.3%

High Performers (80%+)

All Benchmark Results for Grok-1.5V

Complete list of benchmark scores with detailed information

Resources