Comprehensive side-by-side LLM comparison
Qwen2 72B Instruct leads with 7.2% higher average benchmark score. Overall, Qwen2 72B Instruct is the stronger choice for coding tasks.
xAI
Grok 1.5 was developed by xAI as an advanced language model, designed to provide helpful and accurate information with a focus on reasoning and factual knowledge. Built to serve as a conversational AI with strong analytical capabilities, it represents xAI's early generation of foundation models.
Alibaba Cloud / Qwen Team
Qwen2 72B was developed as the flagship model in the Qwen2 generation, designed to provide advanced language understanding with 72 billion parameters. Built to deliver strong performance across diverse tasks, it represented a significant advancement in Qwen's model capabilities when introduced.
3 months newer

Grok-1.5
xAI
2024-03-28

Qwen2 72B Instruct
Alibaba Cloud / Qwen Team
2024-07-23
Average performance across 6 common benchmarks

Grok-1.5

Qwen2 72B Instruct
Available providers and their performance metrics

Grok-1.5

Qwen2 72B Instruct

Grok-1.5

Qwen2 72B Instruct