Gemini 3.1 Flash Lite

Name: Gemini 3.1 Flash Lite
Brand: Google
Rating: 3.5 (424 reviews)

budget

Google•Released on 2026-03-03

Google's fastest and most cost-efficient Gemini 3 series model. 2.5X faster Time to First Token and 45% faster output than 2.5 Flash. Designed for high-volume workloads including translation, content moderation, UI generation, and simulations. Supports adjustable thinking levels.

Overall Score

Core Specs

1000K

Context Window

NaNK

Max Output

✓

Reasoning

✗

Open Source

Multimodal Support

textimage

Scenario Scores

User Feedback Highlights

Based on community feedback. Hover to see original reviews.

+ Great for high-volume tasks+ Cheapest Gemini 3 model ($0.25/$1.50)+ 2.5X faster TTFT than 2.5 Flash+ 45% faster output speed+ Free tier via AI Studio− Preview stage only− New model, limited community feedback− Not optimized for complex coding tasks+ Adjustable thinking levels

Sentiment:👍 0%😐 0%👎 0%

Pros & Cons

Pros

+Cheapest Gemini 3 model ($0.25/$1.50)
+2.5X faster TTFT than 2.5 Flash
+45% faster output speed
+Adjustable thinking levels
+Great for high-volume tasks
+Free tier via AI Studio

Cons

−New model, limited community feedback
−Not optimized for complex coding tasks
−Preview stage only

Reliability

SLA99.5%

Incidents (30d)0

View Status Page →

Pricing

Input (per 1M tokens)$0.25

Output (per 1M tokens)$1.50

Free trial available

Updated on

Compare with Others

Gemini 3.1 Flash Lite vs GPT-5.4 →Gemini 3.1 Flash Lite vs GPT-5.4 Thinking →Gemini 3.1 Flash Lite vs Claude Opus 4.6 →

Benchmarks

gpqaDiamond86.9%

mmmuPro76.8%

arenaElo1432%