Z.ai: GLM 4.5V

z-ai/glm-4.5v

图像理解工具调用JSON推理流式

GLM-4.5V is a vision-language foundation model for multimodal agent applications. Built on a Mixture-of-Experts (MoE) architecture with 106B parameters and 12B activated parameters, it achieves state-of-the-art results in video understanding,...

价格

输入

$0.6 / 1M

输出

$1.80 / 1M

参数

上下文

65,536 tokens

输入模态

text, image

输出模态

text

知识截止：2024-12-31

发布：2025-08

支持参数

frequency_penaltyinclude_reasoningmax_tokenspresence_penaltyreasoningrepetition_penaltyresponse_formatseedstoptemperaturetool_choicetoolstop_ktop_p

开放权重 · HuggingFace

178,678 月下载

718 收藏

mit image-text-to-text

arXiv:2507.01006

在 HuggingFace 查看 →

使用该模型 →