← All models
I
Inception: Mercury 2
inception/mercury-2
Tool useJSONReasoningStreaming
Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM). Instead of generating tokens sequentially, Mercury 2 produces and refines multiple tokens in parallel, achieving...
Pricing
Input
$0.25 / 1M
Output
$0.75 / 1M
Specs
Context
128,000 tokens
Input
text
Output
text
Released: 2026-03
Supported parameters
include_reasoningmax_tokensreasoningresponse_formatstopstructured_outputstemperaturetool_choicetools