← All models
X
Xiaomi: MiMo-V2.5
xiaomi/mimo-v2.5
VisionTool useJSONReasoningStreaming
MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance at roughly half the inference cost, while surpassing MiMo-V2-Omni in multimodal perception across image and video understanding...
Pricing
Input
$0.14 / 1M
Output
$0.28 / 1M
Specs
Context
1,048,576 tokens
Input
text, audio, image, video
Output
text
Released: 2026-04
Supported parameters
frequency_penaltyinclude_reasoningmax_tokenspresence_penaltyreasoningresponse_formatstoptemperaturetool_choicetoolstop_p