This website uses cookies to anonymously analyze website traffic using Google Analytics.

Preview

Together Instant GPU Clusters

Launch clusters of up to 64 GPUs, entirely self-service, in minutes

You're on the list - please make sure to have created a Together AI account so we can grant you access as soon as we're able.
Oops! Something went wrong while submitting the form.

Why Together Instant GPU Clusters?

  • Instant AI Compute

    No approvals, no wait times—deploy high-performance clusters in minutes.

  • Optimized for Distributed AI

    Together Instant GPU Clusters feature NVIDIA Quantum-2 InfiniBand and NVLink networking, ensuring ultra-low-latency, high-throughput performance for large-scale AI workloads.

  • Kubernetes or Slurm

    Customers can choose Kubernetes or Slurm for workload orchestration, giving them full control over their AI infrastructure.

  • No Limitations on Driver or CUDA Versions

    Users have full control over the software environment, ensuring compatibility with any required driver or CUDA version, without restrictions.

  • No Long-Term Commitments

    Unlike traditional cloud GPU offerings, Instant GPU Clusters enable teams to spin up and down compute resources for short-term projects, eliminating the need for lengthy, upfront commitments.

  • Enterprise-Grade Performance

    Each cluster is built using NVIDIA H100 (80GB SXM) GPUs, engineered for AI training, inference, and fine-tuning at scale.

How it Works

Bare-Metal Performance Built with the NVIDIA NCP reference architecture, Together Instant Clusters provide bare-metal performance for compute, network and storage resources, making them ideal for high performance multi-node for AI training and inference.

End-to-End Cluster Management
From creation to deployment, our software stack streamlines every step including acceptance testing, validation, and installation of Kubernetes or Slurm.

Ultra-Fast Provisioning Get your cluster in minutes with Together’s instant deployment, ensuring quick access to high-performance AI infrastructure.

Try the interactive calculator

  • HARDWARE TYPES

    pricING

  • HARDWARE TYPES

    NVIDIA GB200

    Pricing

    Coming soon

  • HARDWARE TYPES

    NVIDIA B200

    Pricing

    Coming soon

  • HARDWARE TYPES

    NVIDIA H200

    Pricing

    Coming soon

  • HARDWARE TYPES

    NVIDIA H100

    Pricing

    up to 16: $2.85 GPU/hr
    up to 64: $2.82 GPU/hr

  • STORAGE TYPES

    STORAGE SIZE

    pricING

  • STORAGE TYPES

    Shared Storage

    STORAGE SIZE

    up to 100Tb

    pricING

    $0.16 Gib/mo

 GPU Calculator

Storage Calculator