Back to Models
Llama 3.1 nemotron ultra 253b v1
Llama 3.1 Nemotron Ultra 253B is a derivative of Llama 3.1 405B, optimized for reasoning and chat. It offers a balance of accuracy and efficiency.
Thinking Mode
Parameters
253000000000 B
Context
131,072 tokens
Released
Jul 4, 2025
Leaderboards
Average Score combining domain-specific Autobench scores; Higher is better
Performance vs. Industry Average
Context Window
Llama 3.1 nemotron ultra 253b v1 has a smaller context window than average (406k tokens), with a context window of 131k tokens.