GPU / HGX B200

NVIDIA HGX B200

NVIDIA Blackwell platform has arrived on Together AI

Why HGX B200 on Together GPU Clusters?

The world’s most powerful AI infrastructure. Delivered faster. Tuned smarter.

Train faster on 8-GPU HGX nodes

Each system includes 8 Blackwell GPUs with 2nd-gen Transformer Engine and FP8 precision, optimized for maximum throughput.

Optimized NVLink topologies

Configured 1.8TB/s NVSwitch fabrics per node, extending to spine-leaf or Clos topologies for dense LLMs or sparsely activated MoE workloads.

Delivery in 4–6 weeks, no NVIDIA lottery required

Full-rack clusters ship with thousands of GPUs available; no backorder delays.

Run by researchers who train models

Team actively tunes training workloads on NVIDIA GB200 systems.

What our customers are saying

View All Stories

"Delivering competitive pricing, strong reliability and a properly set up cluster is the bulk of the value differentiation for most AI clouds. The only differentiated value we have seen outside this set is from a Neocloud called Together AI, where the inventor of FlashAttention, Tri Dao, works. We don't believe the value created by Together can be replicated elsewhere."

Dylan Patel

Founder, SemiAnalysis

"Training our omnimodal Character-3 model required infrastructure designed for large-scale AI. The Together Frontier AI Factory delivered the performance we needed to push the boundaries of multimodal video generation. Together AI understands what builders need — and that made all the difference."

Michael Lingelbach

CEO, Hedra

"Together GPU Clusters provided a combination of amazing training performance, expert support, and the ability to scale to meet our rapid growth to help us serve our growing community of AI creators."

Demi Guo

CEO, Pika

“Together AI provides the performance and reliability we need for real-time, high-quality image and video generation at scale. We value that Together AI is much more than an infrastructure provider — they're a true innovation partner, enabling us to push creative boundaries without compromise.”

Victor Perez

Co-Founder, Krea

View All Stories

Outstanding specs of HGX B200

Performance

Faster inference

vs H100

Faster inference

15x

vs H100

Better efficiency

12x

vs H100

Salesforce AI Research

"We’ve been thoroughly impressed with the Together Enterprise Platform. It has delivered a 2x reduction in latency (time to first token) and cut our costs by approximately a third. These improvements allow us to launch AI-powered features and deliver lightning-fast experiences faster than ever before."

Caiming Xiong

VP Salesforce AI Research

Technical specification

Blackwell GPUs 8 GPUs
Total FP4 Tensor Core 144 PFLOPS
Total FP8/FP6 Tensor Core 72 PFLOPS
Total Fast Memory Up to 1.4 TB
Total Memory Bandwidth Up to 62 TB/s
Total NVLink Bandwidth 14.4 TB/s
FP4 Tensor Core (per GPU) 18 PFLOPS
FP8/FP6 Tensor Core (per GPU) 9 PFLOPS
INT8 Tensor Core 9 POPS
FP16/BF16 Tensor Core 4.5 PFLOPS
TF32 Tensor Core 2.2 PFLOPS
FP32 75 TFLOPS
FP64 / FP64 Tensor Core 37 TFLOPS
Multi-Instance GPU (MIG) 7
Decompression Engine Yes
Decoders 7 NVDEC, 7 nvJPEG
Max Thermal Design Power (TDP) Configurable up to 1,000 W
Interconnect 5th Gen NVLink: 1.8 TB/s, PCIe Gen5: 128 GB/s

Infrastructure you can trust at scale.
Production-grade security.

We take security and compliance seriously, with strict data privacy controls to keep your information protected. Your data and models remain fully under your ownership, safeguarded by robust security measures.

Learn More

As an NVIDIA Cloud Partner, Together builds and operates clusters on NVIDIA NCP reference architectures for predictable performance and faster time to production. Your data and models remain under your control with strict privacy safeguards and SOC 2–compliant security practices.

NVIDIA preferred partner
AICPA SOC 2 Type II

Regions and availability zones

Choose from global regions to meet data residency and compliance requirements—HIPAA for healthcare, GDPR for Europe, or banking regulations.

USA
2GW+ in the portfolio with 600MW of near-term capacity in US.
Europe
150 MW+ available in Europe: UK, Spain, France, Portugal, and Iceland also.
Asia & Middle East
Options available based on the scale of the projects in Asia and the Middle East.