Inception: Mercury 2

inception/mercury-2

Tool useJSONReasoningStreaming

Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM). Instead of generating tokens sequentially, Mercury 2 produces and refines multiple tokens in parallel, achieving...

Pricing

Input

$0.25 / 1M

Output

$0.75 / 1M

Specs

Context

128,000 tokens

Input

text

Output

text

Released: 2026-03

Supported parameters

include_reasoningmax_tokensreasoningresponse_formatstopstructured_outputstemperaturetool_choicetools

Use this model →