← 模型广场

DeepSeek: R1 Distill Qwen 32B

deepseek/deepseek-r1-distill-qwen-32b

JSON推理流式

DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](https://huggingface.co/Qwen/Qwen2.5-32B), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). It outperforms OpenAI's o1-mini across various benchmarks, achieving new...

价格

输入

$0.29 / 1M

输出

$0.29 / 1M

参数

上下文

128,000 tokens

输入模态

text

输出模态

text

知识截止:2024-07-31

发布:2025-01

支持参数

frequency_penaltyinclude_reasoninglogprobsmax_tokenspresence_penaltyreasoningrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetop_logprobstop_p

开放权重 · HuggingFace

601,125 月下载
1,564 收藏
mit text-generation
在 HuggingFace 查看 →