Back to Models

Gemma 4 26B A4B IT

Gemma 4 26B A4B IT is a latency-optimized open-weights MoE model activating only 3.8B parameters per token. It delivers near-31B dense quality while preserving hardware constraints for edge and enterprise deployments.

Thinking Mode
Parameters
26000000000 B
Context
262,144 tokens
Released
Invalid Date

Leaderboards

Performance vs. Industry Average

Intelligence

Gemma 4 26B A4B IT is of lower intelligence compared to average (2.8), with an intelligence score of 2.5.

Price

Gemma 4 26B A4B IT is cheaper compared to average ($0.67 per 1M Tokens) with a price of $0.02 per 1M Tokens.

Latency

Gemma 4 26B A4B IT has a lower average latency compared to average (45.95s), with an average latency of 13.64s.

P99 Latency

Gemma 4 26B A4B IT has a lower P99 latency compared to average (131.50s), taking 43.97s to receive the first token at P99 (TTFT).

Context Window

Gemma 4 26B A4B IT has a smaller context window than average (401k tokens), with a context window of 262k tokens.

Gemma 4 26B A4B IT - AutoBench