Command Palette

Search for a command to run...

Back to all models
nvidia

NVIDIA: Nemotron 3 Ultra

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

nvidia/nemotron-3-ultra-550b-a55b

Context Size

1,000K

Input Price

75,000 pts/M

Output Price

375,000 pts/M


Architecture

Text

Supported Parameters

frequency_penaltyinclude_reasoninglogit_biasmax_tokensmin_ppresence_penaltyreasoningrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_ktop_p

Details

TokenizerOther
Max Completion16,384 tokens
Provider Context262.144K tokens
ModeratedNo
CreatedJun 4, 2026