Llama 3.1 Nemotron Ultra 253B v1

NVIDIA-tuned 253B Llama 3.1 model optimized for enterprise applications and instruction following

Parameters

253 B

Context

128,000 tokens

Released

Nov 1, 2024

Leaderboards

Average Score combining domain-specific Autobench scores; Higher is better

Llama 3.1 Nemotron Ultra 253B v1 has a smaller context window than average (406k tokens), with a context window of 128k tokens.