Models / Meta
Chat

NIM Llama 3.1 Nemotron 70B Instruct

NVIDIA NIM for GPU accelerated Llama 3.1 Nemotron 70B Instruct inference through OpenAI compatible APIs.

This model is not available on Together’s Serverless API.

Deploy this model on an on-demand Dedicated Endpoint or pick a supported alternative from the Model Library.

Related models
  • Model provider
    Meta
  • Type
    Chat
  • Parameters
    70B
  • Context length
    128K