← All models
Z
Z.ai: GLM 5V Turbo
z-ai/glm-5v-turbo
VisionTool useJSONReasoningStreaming
GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, video, and text inputs, excels at long-horizon planning, complex coding,...
Pricing
Input
$1.20 / 1M
Output
$4.00 / 1M
Specs
Context
202,752 tokens
Input
image, text, video
Output
text
Released: 2026-04
Supported parameters
include_reasoningmax_tokensreasoningresponse_formattemperaturetool_choicetoolstop_p