Comprehensive side-by-side LLM comparison
Claude 3.7 Sonnet leads with 16.2% higher average benchmark score. Jamba 1.5 Large offers 184.0K more tokens in context window than Claude 3.7 Sonnet. Jamba 1.5 Large is $8.00 cheaper per million tokens. Claude 3.7 Sonnet supports multimodal inputs. Claude 3.7 Sonnet is available on 4 providers. Overall, Claude 3.7 Sonnet is the stronger choice for coding tasks.
Anthropic
Claude 3.7 Sonnet is a multimodal language model developed by Anthropic. It achieves strong performance with an average score of 74.1% across 11 benchmarks. It excels particularly in MATH-500 (96.2%), IFEval (93.2%), MMMLU (86.1%). It supports a 328K token context window for handling large documents. The model is available through 4 API providers. As a multimodal model, it can process and understand text, images, and other input formats seamlessly. Released in 2025, it represents Anthropic's latest advancement in AI technology.
AI21 Labs
Jamba 1.5 Large is a language model developed by AI21 Labs. It achieves strong performance with an average score of 65.5% across 8 benchmarks. It excels particularly in ARC-C (93.0%), GSM8k (87.0%), MMLU (81.2%). It supports a 512K token context window for handling large documents. The model is available through 2 API providers. Released in 2024, it represents AI21 Labs's latest advancement in AI technology.
6 months newer
Jamba 1.5 Large
AI21 Labs
2024-08-22
Claude 3.7 Sonnet
Anthropic
2025-02-24
Cost per million tokens (USD)
Claude 3.7 Sonnet
Jamba 1.5 Large
Context window and performance specifications
Average performance across 18 common benchmarks
Claude 3.7 Sonnet
Jamba 1.5 Large
Jamba 1.5 Large
2024-03-05
Available providers and their performance metrics
Claude 3.7 Sonnet
Anthropic
Bedrock
ZeroEval
Claude 3.7 Sonnet
Jamba 1.5 Large
Claude 3.7 Sonnet
Jamba 1.5 Large
Jamba 1.5 Large
Bedrock