Xiaomi: MiMo-V2.5

xiaomi/mimo-v2.5

VisionTool useJSONReasoningStreaming

MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance at roughly half the inference cost, while surpassing MiMo-V2-Omni in multimodal perception across image and video understanding...

Pricing

Input

$0.14 / 1M

Output

$0.28 / 1M

Specs

Context

1,048,576 tokens

Input

text, audio, image, video

Output

text

Released: 2026-04

Supported parameters

frequency_penaltyinclude_reasoningmax_tokenspresence_penaltyreasoningresponse_formatstoptemperaturetool_choicetoolstop_p

Open weights · HuggingFace

213,923 downloads/mo

275 likes

mit

View on HuggingFace →

Use this model →