Back to Models

Llama 3.1 Nemotron Ultra 253B v1

NVIDIA-tuned 253B Llama 3.1 model optimized for enterprise applications and instruction following

Parameters
253 B
Context
128,000 tokens
Released
Nov 1, 2024

Leaderboards

Performance vs. Industry Average

Context Window

Llama 3.1 Nemotron Ultra 253B v1 has a smaller context window than average (351k tokens), with a context window of 128k tokens.

Llama 3.1 Nemotron Ultra 253B v1 - AutoBench