NVIDIA

Llama 3.1 Nemotron Ultra 253B v1

Zero-eval
#2BFCL v2

by NVIDIA

+
+
+
+
About

Llama 3.1 Nemotron Ultra 253B v1 is a language model developed by NVIDIA. It achieves strong performance with an average score of 79.2% across 6 benchmarks. It excels particularly in MATH-500 (97.0%), IFEval (89.5%), GPQA (76.0%). Released in 2025, it represents NVIDIA's latest advancement in AI technology.

+
+
+
+
Timeline
AnnouncedApr 7, 2025
ReleasedApr 7, 2025
Knowledge CutoffDec 1, 2023
+
+
+
+
License & Family
License
Llama 3.1 Community License
Performance Overview
Performance metrics and category breakdown

Overall Performance

6 benchmarks
Average Score
79.2%
Best Score
97.0%
High Performers (80%+)
2
+
+
+
+
All Benchmark Results for Llama 3.1 Nemotron Ultra 253B v1
Complete list of benchmark scores with detailed information
MATH-500
text
0.97
97.0%
Self-reported
IFEval
text
0.89
89.5%
Self-reported
GPQA
text
0.76
76.0%
Self-reported
BFCL v2
text
0.74
74.1%
Self-reported
AIME 2025
text
0.72
72.5%
Self-reported
LiveCodeBench
text
0.66
66.3%
Self-reported