← All models View on HuggingFace →
Qwen: Qwen2.5 VL 72B Instruct
qwen/qwen2.5-vl-72b-instruct
VisionJSONStreaming
Qwen2.5-VL is proficient in recognizing common objects such as flowers, birds, fish, and insects. It is also highly capable of analyzing texts, charts, icons, graphics, and layouts within images.
Pricing
Input
$0.25 / 1M
Output
$0.75 / 1M
Specs
Context
131,072 tokens
Input
text, image
Output
text
Knowledge cutoff: 2024-06-30
Released: 2025-02
Supported parameters
frequency_penaltylogit_biasmax_tokenspresence_penaltyrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetop_ktop_p
Open weights · HuggingFace
381,678 downloads/mo
623 likes
other image-text-to-text