
Qwen3 32B
Zero-eval
#1MultiLF
#2Arena Hard
by Alibaba Cloud / Qwen Team
+
+
+
+
About
Qwen3 32B is a language model developed by Alibaba Cloud / Qwen Team. It achieves strong performance with an average score of 72.0% across 9 benchmarks. It excels particularly in Arena Hard (93.8%), AIME 2024 (81.4%), LiveBench (74.9%). It supports a 256K token context window for handling large documents. The model is available through 3 API providers. It's licensed for commercial use, making it suitable for enterprise applications. Released in 2025, it represents Alibaba Cloud / Qwen Team's latest advancement in AI technology.
+
+
+
+
Pricing Range
Input (per 1M)$0.10 -$0.40
Output (per 1M)$0.30 -$0.80
Providers3
+
+
+
+
Timeline
AnnouncedApr 29, 2025
ReleasedApr 29, 2025
+
+
+
+
License & Family
License
Apache 2.0
Performance Overview
Performance metrics and category breakdown
Overall Performance
9 benchmarks
Average Score
72.0%
Best Score
93.8%
High Performers (80%+)
2Performance Metrics
Max Context Window
256.0KAvg Throughput
129.0 tok/sAvg Latency
1ms+
+
+
+
All Benchmark Results for Qwen3 32B
Complete list of benchmark scores with detailed information
Arena Hard | text | 0.94 | 93.8% | Self-reported | |
AIME 2024 | text | 0.81 | 81.4% | Self-reported | |
LiveBench | text | 0.75 | 74.9% | Self-reported | |
MultiLF | text | 0.73 | 73.0% | Self-reported | |
AIME 2025 | text | 0.73 | 72.9% | Self-reported | |
BFCL | text | 0.70 | 70.3% | Self-reported | |
CodeForces | text | 0.66 | 65.9% | Self-reported | |
LiveCodeBench | text | 0.66 | 65.7% | Self-reported | |
Aider | text | 0.50 | 50.2% | Self-reported |