Back to Models

Gemini 2.5 flash lite

Gemini 2.5 Flash-Lite is a lightweight reasoning model optimized for ultra-low latency. It offers a 1M context window and is designed for cost-effective, high-throughput applications.

Thinking Mode
Parameters
N/A
Context
1,000,000 tokens
Released
Invalid Date

Leaderboards

Performance vs. Industry Average

Intelligence

Gemini 2.5 flash lite is of lower intelligence compared to average (4.1), with an intelligence score of 3.9.

Price

Gemini 2.5 flash lite is cheaper compared to average ($4.58 per 1M Tokens) with a price of $0.21 per 1M Tokens.

Latency

Gemini 2.5 flash lite has a lower average latency compared to average (116.45s), with an average latency of 20.42s.

P99 Latency

Gemini 2.5 flash lite has a lower P99 latency compared to average (339.37s), taking 69.09s to receive the first token at P99 (TTFT).

Context Window

Gemini 2.5 flash lite has a larger context window than average (351k tokens), with a context window of 1000k tokens.

Gemini 2.5 flash lite - AutoBench