Back to Models

Llama 3.3 Nemotron Super 49B v1

NVIDIA optimized 49B Llama 3.3 model providing excellent performance-to-size ratio

Parameters
490 B
Context
128,000 tokens
Released
Nov 22, 2024

Leaderboards

Performance vs. Industry Average

Context Window

Llama 3.3 Nemotron Super 49B v1 has a smaller context window than average (351k tokens), with a context window of 128k tokens.

Llama 3.3 Nemotron Super 49B v1 - AutoBench