Back to all models
nvidia
NVIDIA: Llama 3.1 Nemotron 70B Instruct
NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels...
nvidia/llama-3.1-nemotron-70b-instruct
Context Size
131.072K
Input Price
7,380 Ks/M
Output Price
7,380 Ks/M
Architecture
Text
Supported Parameters
frequency_penaltymax_tokensmin_ppresence_penaltyrepetition_penaltyresponse_formatseedstoptemperaturetool_choicetoolstop_ktop_p
Details
TokenizerLlama3
Instruct Typellama3
Max Completion16,384 tokens
Provider Context131.072K tokens
ModeratedNo