
Llama 3.1 Nemotron 70B Instruct
Zero-eval
#1GSM8K Chat
#1MMLU Chat
#1Instruct HumanEval
+1 more
by NVIDIA
+
+
+
+
About
Llama 3.1 Nemotron 70B Instruct is a language model developed by NVIDIA. It achieves strong performance with an average score of 67.9% across 11 benchmarks. It excels particularly in GSM8k (91.4%), HellaSwag (85.6%), Winogrande (84.5%). Released in 2024, it represents NVIDIA's latest advancement in AI technology.
+
+
+
+
Timeline
AnnouncedOct 1, 2024
ReleasedOct 1, 2024
Knowledge CutoffDec 1, 2023
+
+
+
+
License & Family
License
Llama 3.1 Community License
Base ModelLlama 3.1 70B Instruct
Performance Overview
Performance metrics and category breakdown
Overall Performance
11 benchmarks
Average Score
67.9%
Best Score
91.4%
High Performers (80%+)
6+
+
+
+
All Benchmark Results for Llama 3.1 Nemotron 70B Instruct
Complete list of benchmark scores with detailed information
GSM8k | text | 0.91 | 91.4% | Self-reported | |
HellaSwag | text | 0.86 | 85.6% | Self-reported | |
Winogrande | text | 0.85 | 84.5% | Self-reported | |
GSM8K Chat | text | 0.82 | 81.9% | Self-reported | |
MMLU Chat | text | 0.81 | 80.6% | Self-reported | |
MMLU | text | 0.80 | 80.2% | Self-reported | |
Instruct HumanEval | text | 0.74 | 73.8% | Self-reported | |
ARC-C | text | 0.69 | 69.2% | Self-reported | |
TruthfulQA | text | 0.59 | 58.6% | Self-reported | |
XLSum English | text | 0.32 | 31.6% | Self-reported |
Showing 1 to 10 of 11 benchmarks