Gemini 2.5 flash lite

Gemini 2.5 Flash-Lite is a lightweight reasoning model optimized for ultra-low latency. It offers a 1M context window and is designed for cost-effective, high-throughput applications.

Thinking Mode

Parameters

N/A

Context

1,000,000 tokens

Released

Invalid Date

Leaderboards

Average Score combining domain-specific Autobench scores; Higher is better

Performance vs. Industry Average

Context Window

Gemini 2.5 flash lite has a larger context window than average (406k tokens), with a context window of 1000k tokens.