
GPT OSS 20B
Zero-eval
#2CodeForces
#2HealthBench
#2HealthBench Hard
by OpenAI
+
+
+
+
About
GPT OSS 20B is a language model developed by OpenAI. The model shows competitive results across 7 benchmarks. It excels particularly in MMLU (85.3%), CodeForces (83.9%), GPQA (71.5%). It supports a 262K token context window for handling large documents. The model is available through 4 API providers. It's licensed for commercial use, making it suitable for enterprise applications. Released in 2025, it represents OpenAI's latest advancement in AI technology.
+
+
+
+
Pricing Range
Input (per 1M)$0.05 -$0.10
Output (per 1M)$0.20 -$0.50
Providers4
+
+
+
+
Timeline
AnnouncedAug 5, 2025
ReleasedAug 5, 2025
+
+
+
+
License & Family
License
Apache 2.0
Performance Overview
Performance metrics and category breakdown
Overall Performance
7 benchmarks
Average Score
52.3%
Best Score
85.3%
High Performers (80%+)
2Performance Metrics
Max Context Window
262.1KAvg Throughput
705.0 tok/sAvg Latency
2ms+
+
+
+
All Benchmark Results for GPT OSS 20B
Complete list of benchmark scores with detailed information
MMLU | text | 0.85 | 85.3% | Self-reported | |
CodeForces | text | 0.84 | 83.9% | Self-reported | |
GPQA | text | 0.71 | 71.5% | Self-reported | |
TAU-bench Retail | text | 0.55 | 54.8% | Self-reported | |
HealthBench | text | 0.42 | 42.5% | Self-reported | |
Humanity's Last Exam | multimodal | 0.17 | 17.3% | Self-reported | |
HealthBench Hard | text | 0.11 | 10.8% | Self-reported |