Z.ai: GLM 5V Turbo

z-ai/glm-5v-turbo

VisionTool useJSONReasoningStreaming

GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, video, and text inputs, excels at long-horizon planning, complex coding,...

Pricing

Input

$1.20 / 1M

Output

$4.00 / 1M

Specs

Context

202,752 tokens

Input

image, text, video

Output

text

Released: 2026-04

Supported parameters

include_reasoningmax_tokensreasoningresponse_formattemperaturetool_choicetoolstop_p

Use this model →