← All models

Qwen: Qwen2.5 VL 72B Instruct

qwen/qwen2.5-vl-72b-instruct

VisionJSONStreaming

Qwen2.5-VL is proficient in recognizing common objects such as flowers, birds, fish, and insects. It is also highly capable of analyzing texts, charts, icons, graphics, and layouts within images.

Pricing

Input

$0.25 / 1M

Output

$0.75 / 1M

Specs

Context

131,072 tokens

Input

text, image

Output

text

Knowledge cutoff: 2024-06-30

Released: 2025-02

Supported parameters

frequency_penaltylogit_biasmax_tokenspresence_penaltyrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetop_ktop_p

Open weights · HuggingFace

381,678 downloads/mo
623 likes
other image-text-to-text
View on HuggingFace →