← All models
Qwen: Qwen3.5-Flash
qwen/qwen3.5-flash-02-23
VisionTool useJSONReasoningStreaming
The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the...
Pricing
Input
$0.065 / 1M
Output
$0.26 / 1M
Specs
Context
1,000,000 tokens
Input
text, image, video
Output
text
Released: 2026-02
Supported parameters
include_reasoningmax_tokenspresence_penaltyreasoningresponse_formatseedstructured_outputstemperaturetool_choicetoolstop_p