NVIDIA

NVIDIA

+
+
+
+
About

GPU and AI company

+
+
+
+
Portfolio Stats
Total Models5
Multimodal0
Benchmarks Run37
Avg Performance75.4%
+
+
+
+
Latest Release
Nemotron Nano 9B v2
Released: Aug 18, 2025
+
+
+
+
Release Timeline
Recent model releases by year
2025
4 models
2024
1 model
+
+
+
+
Performance Overview
Top models and benchmark performance

Benchmark Categories

Other
37
74.5%

Model Statistics

Multimodal Ratio
0%
Models with Providers
0

All Models

Complete portfolio of 5 models with advanced filtering

LicenseLinks
#01NVIDIANemotron Nano 9B v2
NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and tasks by first generating a reasoning trace and then concluding with a final response. The model's reasoning capabilities can be controlled via a system prompt. If the user prefers the model to provide its final answer without intermediate reasoning traces, it can be configured to do so, albeit with a slight decrease in accuracy for harder prompts that require reasoning. Conversely, allowing the model to generate reasoning traces first generally results in higher-quality final solutions to queries and tasks.
Aug 18, 2025
NVIDIA Open Model License Agreement
---71.1%-
#02NVIDIALlama 3.1 Nemotron Ultra 253B v1
A 253B parameter derivative of Meta Llama 3.1 405B Instruct, developed by NVIDIA using Neural Architecture Search (NAS) and vertical compression. It underwent multi-phase post-training (SFT for Math, Code, Reasoning, Chat, Tool Calling; RL with GRPO) to enhance reasoning and instruction-following. Optimized for accuracy/efficiency tradeoff on NVIDIA GPUs. Supports 128k context.
Apr 7, 2025
Llama 3.1 Community License
---66.3%-
#03NVIDIALlama 3.1 Nemotron Nano 8B V1
Llama-3.1-Nemotron-Nano-8B-v1 is a large language model (LLM) which is a derivative of Meta Llama-3.1-8B-Instruct (AKA the reference model). It is a reasoning model that is post trained for reasoning, human chat preferences, and tasks, such as RAG and tool calling.
Mar 18, 2025
Llama 3.1 Community License
----84.6%
#04NVIDIALlama-3.3 Nemotron Super 49B v1
Llama-3.3-Nemotron-Super-49B-v1 is a large language model (LLM) derived from Meta Llama-3.3-70B-Instruct. It's post-trained for reasoning, chat, RAG, and tool calling, offering a balance between accuracy and efficiency (optimized for single H100). It underwent multi-phase post-training including SFT and RL (RLOO, RPO).
Mar 18, 2025
Llama 3.1 Community License
----91.3%
#05NVIDIALlama 3.1 Nemotron 70B Instruct
A large language model customized by NVIDIA to improve the helpfulness of LLM generated responses. It is a fine-tuned version of Llama 3.1 70B Instruct. The model was trained using RLHF (REINFORCE) with HelpSteer2-Preference prompts.
Oct 1, 2024
Llama 3.1 Community License
-----
+
+
+
+
Resources