← All models

Qwen: Qwen3.5-Flash

qwen/qwen3.5-flash-02-23

VisionTool useJSONReasoningStreaming

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the...

Pricing

Input

$0.065 / 1M

Output

$0.26 / 1M

Specs

Context

1,000,000 tokens

Input

text, image, video

Output

text

Released: 2026-02

Supported parameters

include_reasoningmax_tokenspresence_penaltyreasoningresponse_formatseedstructured_outputstemperaturetool_choicetoolstop_p