Reserve Your NVIDIA HGX B200 Instances Now

The NVIDIA HGX™ B200 propels the data center into a new era of accelerated computing and generative AI, integrating NVIDIA Blackwell GPUs with a high-speed interconnect to accelerate AI performance at scale.

The NVIDIA HGX B200 will be available soon in the Cirrascale AI Innovation Cloud. Experience the highest performance in generative AI and HPC applications.

Reserve Now

Cirrascale Offers the HGX B200 for Next-Level Performance

Cirrascale offers the HGX B200 in its AI Innovation Cloud as an 8-GPU configuration giving you full GPU-to-GPU bandwidth through NVIDIA NVLink™ Switch. As a premier accelerated scaleup x86 platform with up to 15X faster real-time inference performance, 12X lower cost, and 12X less energy use, HGX B200 is designed for the most demanding AI, data analytics, and high-performance computing (HPC) workloads.

The NVIDIA Blackwell architecture introduces groundbreaking advancements for generative AI and accelerated computing. The incorporation of the second generation Transformer Engine, alongside the faster and wider NVIDIA NVLink interconnect, propels the data center into a new era, with orders of magnitude more performance compared to the previous architecture generation.

Real-Time Inference for the Next Generation of Large Language Models

HGX B200 achieves up to 15X higher inference performance over the previous NVIDIA Hopper™ generation for massive models such as GPT MoE 1.8T.

The second-generation Transformer Engine uses custom Blackwell Tensor Core technology combined with TensorRT™-LLM and NVIDIA NeMo™ framework innovations to accelerate inference for LLMs and mixture-of-experts (MoE) models.

Projected performance, subject to change. Token-to-token latency (TTL) = 50ms real time, first token latency(FTL) = 5s, input sequence length = 32,768, output sequence length = 1,028, 8x eight-way HGX H100 GPUs aircooled versus 1x eight-way HGX B200 air-cooled, per GPU performance comparison

Next-Level Training Performance

The second-generation Transformer Engine, featuring FP8 and new precisions, enables a remarkable 3X faster training for large language models like GPT MoE1.8T.

This breakthrough is complemented by fifth-generation NVLink with 1.8TB/sof GPU-to-GPU interconnect, NVLink Switch chip, NVIDIA Quantum InfiniBand networking, and NVIDIA Magnum IO software. Together, these ensure efficient scalability for enterprises and extensive GPU computing clusters.

‍

Projected performance, subject to change. 32,768 GPU scale, 4,096x eight-way HGX H100 air-cooled cluster:400G IB network, 4,096x 8-way HGX B200 air-cooled cluster: 400G IB network.

Why Cirrascale?

We're proud of the fact that we have worked with cloud pioneers from the very start. We were the trusted cloud backbone that helped OpenAI meet their cloud compute needs early on, and we continue to engage with today's bleeding edge AI companies, like yours.

Access to the Latest AI Accelerators

The Cirrascale AI Innovation Cloud contains today's latest accelerators including the NVIDIA HGX™ B200 and H200 GPUs all interconnected with NVIDIA Quantum InfiniBand networking.

Specialized Cloud and Managed Services

Work with us to tailor the right solution for you with our wide range of system configurations, optimized for your specific workload requirements.

Transparent, Budget Friendly Pricing

With our no-surprises billing, long-term discounts, and no data transfer fees; Cirrascale offers unmatched pricing that’s built around your needs.

NVIDIA HGX B200

The New Era of Accelerated Computing is Here

Reserve Your NVIDIA HGX B200 Instances Now

Cirrascale Offers the HGX B200 for Next-Level Performance

Real-Time Inference for the Next Generation of Large Language Models

Next-Level Training Performance

Why Cirrascale?

Access to the Latest AI Accelerators

Specialized Cloud and Managed Services

Transparent, Budget Friendly Pricing

Ready To Get Started?

NVIDIA HGX B200

The New Era of Accelerated Computing is Here

Reserve Your NVIDIA HGX B200 Instances Now

Cirrascale Offers the HGX B200 for Next-Level Performance

Real-Time Inference for the Next Generation of Large Language Models

Next-Level Training Performance

Why Cirrascale?

Access to the Latest AI Accelerators

Specialized Cloud and Managed Services

Transparent, Budget Friendly Pricing

Reserve Your HGX B200 Instances

Ready To Get Started?