nvidia

NVIDIA: Llama 3.1 Nemotron 70B Instruct

NVIDIA's Llama 3.1 Nemotron 70B is a language model designed for generating precise and useful responses. Leveraging [Llama 3.1 70B](/models/meta-llama/llama-3.1-70b-instruct) architecture and Reinforcement Learning from Human Feedback (RLHF), it excels...

nvidia/llama-3.1-nemotron-70b-instruct

Context Size

131.072K

Input Price

7,380 Ks/M

Output Price

7,380 Ks/M

Architecture

Text

Supported Parameters

frequency_penaltymax_tokensmin_ppresence_penaltyrepetition_penaltyresponse_formatseedstoptemperaturetool_choicetoolstop_ktop_p

Details

TokenizerLlama3

Instruct Typellama3

Max Completion16,384 tokens

Provider Context131.072K tokens

ModeratedNo

Command Palette

NVIDIA: Llama 3.1 Nemotron 70B Instruct

Architecture

Supported Parameters

Details