Back to all models
nvidia
NVIDIA: Nemotron 3 Ultra (free)
NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...
nvidia/nemotron-3-ultra-550b-a55b:free
Context Size
1,000K
Input Price
Free
Output Price
Free
Architecture
Text
Supported Parameters
include_reasoningmax_tokensreasoningseedtemperaturetool_choicetoolstop_p
Details
TokenizerOther
Max Completion65,536 tokens
Provider Context1,000K tokens
ModeratedNo
CreatedJun 4, 2026