SargalaySargalay

Command Palette

Search for a command to run...

Back to all models
inception

Inception: Mercury Coder

Mercury Coder is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like Claude 3.5 Haiku and GPT-4o Mini while matching their performance. Mercury Coder's speed means that developers can stay in the flow while coding, enjoying rapid chat-based iteration and responsive code completion suggestions. On Copilot Arena, Mercury Coder ranks 1st in speed and ties for 2nd in quality. Read more in the [blog post here](https://www.inceptionlabs.ai/blog/introducing-mercury).

inception/mercury-coder

Context Size

128K

Input Price

1,537.5 Ks/M

Output Price

4,612.5 Ks/M


Architecture

Text

Supported Parameters

max_tokensresponse_formatstopstructured_outputstemperaturetool_choicetools

Details

TokenizerOther
Max Completion32,000 tokens
Provider Context128K tokens
ModeratedNo