Back to all models
nvidia
NVIDIA: Nemotron 3 Ultra
NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...
nvidia/nemotron-3-ultra-550b-a55b
Context Size
1,000K
Input Price
75,000 pts/M
Output Price
375,000 pts/M
Architecture
Text
Supported Parameters
frequency_penaltyinclude_reasoninglogit_biasmax_tokensmin_ppresence_penaltyreasoningrepetition_penaltyresponse_formatseedstopstructured_outputstemperaturetool_choicetoolstop_ktop_p
Details
TokenizerOther
Max Completion16,384 tokens
Provider Context262.144K tokens
ModeratedNo
CreatedJun 4, 2026