← All models

Google: Gemma 4 31B (free)

google/gemma-4-31b-it:free

VisionTool useJSONReasoningStreaming

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...

Specs

Context

262,144 tokens

Input

image, text, video

Output

text

Released: 2026-04

Supported parameters

include_reasoningmax_tokensreasoningresponse_formatseedstoptemperaturetool_choicetoolstop_p

Open weights · HuggingFace

11,305,133 downloads/mo
2,832 likes
apache-2.0 image-text-to-text
View on HuggingFace →