← All models

Google: Gemma 4 31B

google/gemma-4-31b-it

VisionTool useJSONReasoningStreaming

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...

Pricing

Input

$0.12 / 1M

Output

$0.37 / 1M

Specs

Context

262,144 tokens

Input

image, text, video

Output

text

Released: 2026-04

Supported parameters

frequency_penaltyinclude_reasoninglogit_biaslogprobsmax_tokensmin_ppresence_penaltyreasoningrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_ktop_logprobstop_p

Open weights · HuggingFace

11,305,133 downloads/mo
2,832 likes
apache-2.0 image-text-to-text
View on HuggingFace →