← All models
Google: Gemini 3.1 Flash Lite
google/gemini-3.1-flash-lite
VisionTool useJSONReasoningStreaming
Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...
Pricing
Input
$0.25 / 1M
Output
$1.50 / 1M
Specs
Context
1,048,576 tokens
Input
text, image, video, file, audio
Output
text
Released: 2026-05
Supported parameters
include_reasoningmax_tokensreasoningresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_p