This website uses cookies to anonymously analyze website traffic using Google Analytics.

Preview

Together Instant GPU Clusters

Launch clusters of up to 64 GPUs, entirely self-service, in minutes

You're on the list - please make sure to have created a Together AI account so we can grant you access as soon as we're able.
Oops! Something went wrong while submitting the form.

Why Together Instant GPU Clusters?

  • Instant AI Compute

    No approvals, no wait times—deploy high-performance clusters in minutes.

  • Optimized for Distributed AI

    Together Instant GPU Clusters feature NVIDIA Quantum-2 InfiniBand and NVLink networking, ensuring ultra-low-latency, high-throughput performance for large-scale AI workloads.

  • Kubernetes or Slurm

    Customers can choose Kubernetes or Slurm for workload orchestration, giving them full control over their AI infrastructure.

  • No Limitations on Driver or CUDA Versions

    Users have full control over the software environment, ensuring compatibility with any required driver or CUDA version, without restrictions.

  • No Long-Term Commitments

    Unlike traditional cloud GPU offerings, Instant GPU Clusters enable teams to spin up and down compute resources for short-term projects, eliminating the need for lengthy, upfront commitments.

  • Enterprise-Grade Performance

    Each cluster is built using NVIDIA H100 (80GB SXM) GPUs, engineered for AI training, inference, and fine-tuning at scale.

How it Works

Bare-Metal Performance Built with the NVIDIA NCP reference architecture, Together Instant Clusters provide bare-metal performance for compute, network and storage resources, making them ideal for high performance multi-node for AI training and inference.

End-to-End Cluster Management
From creation to deployment, our software stack streamlines every step including acceptance testing, validation, and installation of Kubernetes or Slurm.

Ultra-Fast Provisioning Get your cluster in minutes with Together’s instant deployment, ensuring quick access to high-performance AI infrastructure.

Try the interactive calculator

  • HARDWARE TYPES

    pricING

  • HARDWARE TYPES

    NVIDIA GB200

    Pricing

    Coming soon

  • HARDWARE TYPES

    NVIDIA B200

    Pricing

    Coming soon

  • HARDWARE TYPES

    NVIDIA H200

    Pricing

    Coming soon

  • HARDWARE TYPES

    NVIDIA H100

    Pricing

    up to 16: $2.85 GPU/hr
    up to 64: $2.82 GPU/hr

  • STORAGE TYPES

    STORAGE SIZE

    pricING

  • STORAGE TYPES

    Shared Storage

    STORAGE SIZE

    up to 100Tb

    pricING

    $0.16 Gib/mo

 GPU Calculator

Storage Calculator

Forging the AI Frontier with NVIDIA Reference Architecture

As an NVIDIA Cloud Partner, we’re on the leading frontier of optimizing and operating the deployment of NVIDIA GB200 NVL72 GPU clusters.

Learn more

United States

AI Data Centers and Power across the US

Data Center
Portfolio

150MW+ available in Europe: UK, Spain, France, Portugal Iceland also.

Europe

Expansion Capability in Europe and Beyond

Data Center
Portfolio

2GW+ in the Portfolio with 600MW of near-term Capacity.

Asia / Middle East

Next Frontiers – Asia and the Middle East

Data Center
Portfolio

Options available based on the scale of the projects in Asia and the Middle East.