Deploy Leading AI Models Accelerated by NVIDIA NIM on Together AI
Together AI is a leader at scaling the deployment of generative AI models with fast performance and industry-leading cost efficiency. Developers can explore and experience the performance and reliability of more than 160 leading AI models powered by NVIDIA NIM and starting today, quickly deploy select NIMs as dedicated endpoints on the Together platform.
NVIDIA NIM is a set of easy-to-use microservices designed for secure, reliable deployment of high-performance AI model inferencing across clouds, data centers, and workstations. NIM includes optimized inference engines, industry-standard APIs, and runtime dependencies for the latest AI models, all prepackaged in enterprise-grade software containers ready to deploy and scale anywhere.
Seamless Enterprise-Grade Deployments on Together AI
The NVIDIA AI Enterprise software platform, which includes NVIDIA NIM, delivers the reliability of continuous proactive security fixes with stable APIs and performance that enterprises depend on to operate at scale. You can now deploy these NIM microservices on Together AI for production workloads at scale. Together AI has tremendous capacity for the most demanding enterprise or rapidly growing applications and supports automatically scaling up or down dedicated infrastructure for your application. A dedicated endpoint for the NIM can be accessed through both the Together API and web playground.
Deploy NIM on Together AI with a few clicks
Together AI is one of the first NVIDIA NIM hosting partners integrated on the NVIDIA API Catalog. This integration creates a streamlined path from model exploration to production deployment on hosted infrastructure. Test and experiment with the latest models powered by NVIDIA NIM on build.nvidia.com, then deploy directly to Together Cloud with just a few clicks.
This direct integration eliminates many of the traditional hurdles in the model deployment pipeline, allowing developers who prefer a hosted endpoint solution to move from experimentation to production quickly and confidently.

Why deploy NVIDIA NIM on Together AI?
Together AI offers several key advantages for organizations looking to leverage NVIDIA NIM microservices as hosted endpoints:
Exceptional scale and capacity: With our infrastructure built specifically for AI workloads, Together AI provides the computing resources needed for even the most demanding applications. Our platform is designed to handle high-throughput requirements with consistent performance.
Developer-centric experience: Our platform is built by developers for developers. With over 450,000 developers already using Together AI, we've refined our offering to provide intuitive interfaces, comprehensive documentation, and responsive support.
Cost-effective resource management: Our auto-scaling and auto-shutdown capabilities ensure you only pay for the resources you actually use, optimizing both performance and cost.
Enterprise-ready performance: Major organizations including Salesforce, Zoom, Zomato, and The Washington Post trust Together AI to power their AI initiatives. Our platform delivers the reliability, security, and performance that enterprise applications demand.
Getting started
Select NIMs are available for easy deployment on Together AI:
- Visit build.nvidia.com to explore the available models powered by NIM.
- Select the models you want to deploy.
- Choose Together AI as your deployment platform (available for some models)
- Configure your deployment options.
- Launch your dedicated endpoint.
Once deployed, you can interact with your models through our comprehensive API or the intuitive web playground.
Looking forward
This collaboration between Together AI and NVIDIA represents an important step toward making powerful AI capabilities more accessible to developers and organizations of all sizes. By combining NVIDIA's state-of-the-art models with the Together AI scalable, developer-friendly platform, we're helping to accelerate the adoption of AI across industries.
We're committed to continuing this partnership and bringing even more innovations to our community in the months ahead.
To learn more about deploying NVIDIA NIM on Together AI, visit api.together.ai/models to browse the latest models powered by NIM available along with the over 200 other models Together AI supports, or stop by booth #1332 at GTC 2025 to speak with our team.
- Lower
Cost20% - faster
training4x - network
compression117x
Q: Should I use the RedPajama-V2 Dataset out of the box?
RedPajama-V2 is conceptualized as a pool of data that serves as a foundation for creating high quality datasets. The dataset is thus not intended to be used out of the box and, depending on the application, data should be filtered out using the quality signals that accompany the data. With this dataset, we take the view that the optimal filtering of data is dependent on the intended use. Our goal is to provide all the signals and tooling that enables this.
article