Back to Models

Llama 3.1 nemotron ultra 253b v1

Llama 3.1 Nemotron Ultra 253B is a derivative of Llama 3.1 405B, optimized for reasoning and chat. It offers a balance of accuracy and efficiency.

Thinking Mode
Parameters
253000000000 B
Context
131,072 tokens
Released
Jul 4, 2025

Leaderboards

Performance vs. Industry Average

Context Window

Llama 3.1 nemotron ultra 253b v1 has a smaller context window than average (351k tokens), with a context window of 131k tokens.

Llama 3.1 nemotron ultra 253b v1 - AutoBench