
Llama-3.3 Nemotron Super 49B v1
Zero-eval
#1MBPP
#2MT-Bench
#3BFCL v2
by NVIDIA
+
+
+
+
About
Llama-3.3 Nemotron Super 49B v1 is a language model developed by NVIDIA. This model demonstrates exceptional performance with an average score of 81.0% across 7 benchmarks. It excels particularly in MATH-500 (96.6%), MT-Bench (91.7%), MBPP (91.3%). Released in 2025, it represents NVIDIA's latest advancement in AI technology.
+
+
+
+
Timeline
AnnouncedMar 18, 2025
ReleasedMar 18, 2025
Knowledge CutoffDec 31, 2023
+
+
+
+
License & Family
License
Llama 3.1 Community License
Performance Overview
Performance metrics and category breakdown
Overall Performance
7 benchmarks
Average Score
81.0%
Best Score
96.6%
High Performers (80%+)
4+
+
+
+
All Benchmark Results for Llama-3.3 Nemotron Super 49B v1
Complete list of benchmark scores with detailed information
MATH-500 | text | 0.97 | 96.6% | Self-reported | |
MT-Bench | text | 0.92 | 91.7% | Self-reported | |
MBPP | text | 0.91 | 91.3% | Self-reported | |
Arena Hard | text | 0.88 | 88.3% | Self-reported | |
BFCL v2 | text | 0.74 | 73.7% | Self-reported | |
GPQA | text | 0.67 | 66.7% | Self-reported | |
AIME 2025 | text | 0.58 | 58.4% | Self-reported |