Back to Models
Llama 3.1 Nemotron Ultra 253B v1
NVIDIA-tuned 253B Llama 3.1 model optimized for enterprise applications and instruction following
Parameters
253 B
Context
128,000 tokens
Released
Nov 1, 2024
Leaderboards
Average Score combining domain-specific Autobench scores; Higher is better
Performance vs. Industry Average
Context Window
Llama 3.1 Nemotron Ultra 253B v1 has a smaller context window than average (406k tokens), with a context window of 128k tokens.