Back to Models

Llama 4 Scout 17B 16E Instruct

Llama 4 Scout variant with 17B parameters and mixture-of-experts architecture for efficiency

Parameters
170 B
Context
128,000 tokens
Released
Apr 1, 2025

Leaderboards

Performance vs. Industry Average

Intelligence

Llama 4 Scout 17B 16E Instruct is of lower intelligence compared to average (4.1), with an intelligence score of 3.6.

Price

Llama 4 Scout 17B 16E Instruct is cheaper compared to average ($0.91 per 1M Tokens) with a price of $0.04 per 1M Tokens.

Latency

Llama 4 Scout 17B 16E Instruct has a lower average latency compared to average (45.24s), with an average latency of 10.87s.

P99 Latency

Llama 4 Scout 17B 16E Instruct has a lower P99 latency compared to average (172.60s), taking 39.62s to receive the first token at P99 (TTFT).

Context Window

Llama 4 Scout 17B 16E Instruct has a smaller context window than average (246k tokens), with a context window of 128k tokens.

Llama 4 Scout 17B 16E Instruct - AutoBench