← All models

Google: Gemini 3.1 Flash Lite

google/gemini-3.1-flash-lite

VisionTool useJSONReasoningStreaming

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...

Pricing

Input

$0.25 / 1M

Output

$1.50 / 1M

Specs

Context

1,048,576 tokens

Input

text, image, video, file, audio

Output

text

Released: 2026-05

Supported parameters

include_reasoningmax_tokensreasoningresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_p