NIM Llama 3.1 Nemotron 70B Instruct
NVIDIA NIM for GPU accelerated Llama 3.1 Nemotron 70B Instruct inference through OpenAI compatible APIs.
This model is not available on Together’s Serverless API.
Deploy this model on an on-demand Dedicated Endpoint or pick a supported alternative from the Model Library.
Related models
- TypeChat
- Parameters70B
- Context length128K
- External link
- CategoryChat