Gemini 3.1 Flash Lite

budget

GoogleReleased on 2026-03-03

Google's fastest and most cost-efficient Gemini 3 series model. 2.5X faster Time to First Token and 45% faster output than 2.5 Flash. Designed for high-volume workloads including translation, content moderation, UI generation, and simulations. Supports adjustable thinking levels.

71
Overall Score

Core Specs

1000K
Context Window
NaNK
Max Output
Reasoning
Open Source
Multimodal Support
textimage

User Feedback Highlights

Based on community feedback. Hover to see original reviews.

+ Great for high-volume tasks+ Cheapest Gemini 3 model ($0.25/$1.50)+ 2.5X faster TTFT than 2.5 Flash+ 45% faster output speed+ Free tier via AI Studio Preview stage only New model, limited community feedback Not optimized for complex coding tasks+ Adjustable thinking levels
Sentiment:👍 0%😐 0%👎 0%

Pros & Cons

Pros

  • +Cheapest Gemini 3 model ($0.25/$1.50)
  • +2.5X faster TTFT than 2.5 Flash
  • +45% faster output speed
  • +Adjustable thinking levels
  • +Great for high-volume tasks
  • +Free tier via AI Studio

Cons

  • New model, limited community feedback
  • Not optimized for complex coding tasks
  • Preview stage only

Reliability

SLA99.5%
Incidents (30d)0
View Status Page →

Pricing

Input (per 1M tokens)$0.25
Output (per 1M tokens)$1.50
Free trial available
Updated on

Benchmarks

gpqaDiamond86.9%
mmmuPro76.8%
arenaElo1432%