Back to Models

Qwen 3 30B-A3B

MoE Qwen 3 model with 30B total parameters, activating 3B for efficient inference

Parameters
300 B
Context
32,768 tokens
Released
Jan 20, 2025

Leaderboards

Performance vs. Industry Average

Intelligence

Qwen 3 30B-A3B is of lower intelligence compared to average (4.1), with an intelligence score of 4.0.

Price

Qwen 3 30B-A3B is cheaper compared to average ($0.91 per 1M Tokens) with a price of $0.08 per 1M Tokens.

Latency

Qwen 3 30B-A3B has a higher average latency compared to average (45.24s), with an average latency of 72.64s.

P99 Latency

Qwen 3 30B-A3B has a higher P99 latency compared to average (172.60s), taking 243.07s to receive the first token at P99 (TTFT).

Context Window

Qwen 3 30B-A3B has a smaller context window than average (246k tokens), with a context window of 33k tokens.

Qwen 3 30B-A3B - AutoBench