← All models

Qwen: Qwen3.5 397B A17B

qwen/qwen3.5-397b-a17b

VisionTool useJSONReasoningStreaming

The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. It delivers...

Pricing

Input

$0.39 / 1M

Output

$2.34 / 1M

Specs

Context

262,144 tokens

Input

text, image, video

Output

text

Released: 2026-02

Supported parameters

frequency_penaltyinclude_reasoninglogit_biaslogprobsmax_tokensmin_ppresence_penaltyreasoningrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_ktop_logprobstop_p

Open weights · HuggingFace

1,112,620 downloads/mo
1,496 likes
apache-2.0 image-text-to-text
View on HuggingFace →