Back to Models

Gemini 2.5 flash

Gemini 2.5 Flash is Google's workhorse model for high-frequency tasks. It features a 1M context window, optimized for speed and efficiency in reasoning and multimodal processing.

Thinking Mode
Parameters
N/A
Context
1,048,576 tokens
Released
Invalid Date

Leaderboards

Performance vs. Industry Average

Intelligence

Gemini 2.5 flash is of higher intelligence compared to average (4.1), with an intelligence score of 4.2.

Price

Gemini 2.5 flash is cheaper compared to average ($4.58 per 1M Tokens) with a price of $2.12 per 1M Tokens.

Latency

Gemini 2.5 flash has a lower average latency compared to average (116.45s), with an average latency of 65.62s.

P99 Latency

Gemini 2.5 flash has a lower P99 latency compared to average (339.37s), taking 173.94s to receive the first token at P99 (TTFT).

Context Window

Gemini 2.5 flash has a larger context window than average (351k tokens), with a context window of 1049k tokens.

Gemini 2.5 flash - AutoBench