Cirrascale Cloud Services

AMD Products

AMD Instinct MI350 Series

AMD Instinct™ MI350 Series GPUs are built on cutting-edge fourth-generation AMD CDNA™ architecture, setting new standards for GenAI and HPC in the cloud. With up to 288GB of HBM3E memory the MI350 delivers exceptional performance for training massive AI models, high-speed inference, and complex scientific workloads. Expanded datatype support, including FP16, FP8,and next-gen FP6 and FP4, maximizes throughput and energy efficiency for advanced AI models.

Cirrascale has announced the upcoming availability of the MI350 Series GPUs in its AI Innovation Cloud. Be sure to sign up to preview these instances when available.

Read the News >‍

AMD Instinct MI300X

AMD Instinct™ MI300X accelerators are uniquely well-suited to power even the most demanding AI and HPC workloads, offering exceptional compute performance, large memory density, high bandwidth memory, and support for specialized data formats.

AMD Instinct MI300X accelerators are built on AMD CDNA™ 3 architecture, which offers Matrix Core Technologies and support for a broad range of precision capabilities—from the highly efficient INT8 and FP8 (including sparsity support for AI) to the most demanding FP64 for HPC.

AMD Instinct MI325X

The AMD Instinct™ MI325X GPU accelerators set a new standard in performance for Gen AI models and data centers. Built on the 3rd Gen AMD CDNA™ architecture, these accelerators are designed for exceptional performance and efficiency for demanding AI tasks like training expansive models and inferencing.

Each accelerator is equipped with industry-leading 256 GB Next Gen HBM3e memory capacity and 6 TB/s bandwidth, combined with the processing power and datatype support required for AI deployments, the MI325X accelerators deliver the compute performance needed for any AI solution.

Learn More >‍

AMD Instinct MI250

The AMD Instinct MI250 accelerator brings customers the compute engine selected for the first U.S. Exascale supercomputer.

AMD Instinct MI250 accelerators are built on AMD CDNA™ architecture, which offers Matrix Core Technologies and support for a broad range of precision capabilities—from the highly efficient INT8 and FP8 to the most demanding FP64 for HPC.

Learn More >

Pricing

AMD Instinct Series Instance Pricing

8X AMD Instinct MI325X

Dual 48-Core

2.3TB

(1) 960 NVMe
(4) 3.84TB NVMe

25Gb Bonded
_{(3200Gb Available)}

8X AMD Instinct MI300X

Dual 48-Core

2.3TB

(1) 960 NVMe
(4) 3.84TB NVMe

25Gb Bonded
_{(3200Gb Available)}

$22,499

$20,249

$17,999

4X AMD Instinct MI250

Dual 64-Core

1TB

(1) 960 NVMe
(1) 3.84TB NVMe

25Gb Bonded

$4,679

$4,211

$3,743

8X AMD Instinct MI300X

4X AMD Instinct MI250

Dual 48-Core

Dual 64-Core

2.3TB

1TB

(1) 960 NVMe
(4) 3.84TB NVMe

(1) 960 NVMe
(1) 3.84TB NVMe

25Gb Bonded
_{(3200Gb Available)}

25Gb Bonded

$22,499

$4,679

$20,249

$4,211

$17,999

$3,743

All pricing above is based on Cirrascale's No Surprises billing model. There are no hidden fees and discounts may apply for long-term commitments depending on the service requested. All pricing shown for servers are per server per month.

Pricing

NVIDIA GPU Cloud

8-GPU
NVIDIA B300

Dual 64-Core

3TB

960 NVMe
(4) 3.84TB NVMe

25Gb Bonded
_{(3200Gb Available)}

8-GPU
NVIDIA B200

Dual 48-Core

2TB

960 NVMe
(4) 3.84TB NVMe

25Gb Bonded
_{(3200Gb Available)}

$34,999

$31,499

$27,999

8-GPU
NVIDIA H200

Dual 48-Core

2TB

960 NVMe
(4) 3.84TB NVMe

25Gb Bonded
_{(3200Gb Available)}

$26,499

$23,849

$21,199

8-GPU
NVIDIA H100

Dual 48-Core

2TB

(1) 960 NVMe
(4) 3.84TB NVMe

25Gb Bonded
_{(3200Gb Available)}

$24,999

$22,499

$19,999

Cirrascale Cloud Services has one of the largest selections of NVIDIA GPUs available in the cloud.
The above represents our most popular instances, but check out our pricing page for more instance types.
Not seeing what you need? Contact us for a specialized cloud quote for the configuration you need.

8-GPU NVIDIA B300

8-GPU NVIDIA B200

8-GPU NVIDIA H200

8-GPU NVIDIA H100

Dual 64-Core

Dual 48-Core

3TB

2TB

(1) 960 NVMe
(4) 3.84TB NVMe

25Gb Bonded
_{(3200Gb Available)}

$34,999

$26,499

$24,999

$31,499

$23,849

$22,499

$27,999

$21,199

$19,999

Pricing

Qualcomm Cloud AI 100 Series Bare-Metal Pricing

8X AI 100 Ultra

128

512GB

(2) 3.84TB NVMe

$4,699

$3,759

Octo AI 100 Pro

384GB

1TB NVMe

$2,499

$2,019

Quad AI 100 Pro

182GB

1TB NVMe

$1,259

$1,009

Dual AI 100 Pro

48GB

1TB NVMe

$629

$519

Single AI 100 Pro (128)

128GB

1TB NVMe

$549

$439

Single AI 100 Pro (64)

64GB

1TB NVMe

$369

$289

Single AI 100 Pro (48)

48GB

1TB NVMe

$329

$259

8X AI 100 Ultra

Octo AI 100 Pro

Quad AI 100 Pro

Dual AI 100 Pro

Single AI 100 Pro (128)

Single AI 100 Pro (64)

Single AI 100 Pro (48)

128

512GB

384GB

182GB

48GB

64GB

48GB

(2) 3.84TB NVMe

1TB NVMe

$4,699

$2,499

$1,259

$629

$549

$369

$329

$3,759

$2,019

$1,009

$519

$439

$289

$259

Pricing

The Cerebras AI Model Studio

Fine-Tuning - Standard Offering Pricing

Eleuther GPT-J

$0.00055

$0.0011

$0.0023

132

Eleuther GPT-NeoX

$0.00190

$0.0039

$0.0078

451

CodeGen* 350M

0.35

$0.00003

$0.00006

$0.00013

CodeGen* 2.7B

2.7

$0.00026

$0.0005

$0.0027

CodeGen* 6.1B

6.1

$0.00065

$0.0013

$0.0030

154

CodeGen* 16.1B

16.1

$0.00147

$0.0030

$0.011

350

Eleuther GPT-J

Eleuther GPT-NeoX

CodeGen* 350M

CodeGen* 2.7B

CodeGen* 6.1B

CodeGen* 16.1B

0.35

2.7

6.1

16.1

$0.00055

$0.00190

$0.00003

$0.00026

$0.00065

$0.00147

$0.0011

$0.0039

$0.00006

$0.0005

$0.0013

$0.0030

$0.0023

$0.0078

$0.00013

$0.0027

$0.0030

$0.011

132

451

154

350

* T5 tokens to train from the original T5 paper. Chinchilla scaling laws not applicable.
‍
** Note that GPT-J was pre-trained on ~400B tokens. Fine-tuning jobs can employ a wide range of dataset sizes, but often use order 1-10% of the pre-training tokens. As such, one might fine-tune a model like GPT-J with ~4-40B tokens. We provide estimated wall clock time to fine-tune train the model checkpoints above with 10B tokens on Cerebras AI Model Studio and an AWS p4d instance in the table above to give you a sense of how much time jobs of this scale could take.

Fixed-Price Production Model Training

GPT3-XL

1.3

0.4

$2,500

GPT-J

120

$45,000

GPT-3 6.7B

6.7

134

$40,000

T-5 11B

34*

$60,000

GPT-3 13B

260

$150,000

GPT NeoX

400

$525,000

GPT 70B

1,400

Contact For Quote

GPT 175B

175

3,500

Contact For Quote

GPT3-XL

GPT-J

GPT-3 6.7B

T-5 11B

GPT-3 13B

GPT NeoX

GPT 70B

GPT 175B

1.3

6.7

175

120

134

34*

260

400

1,400

3,500

0.4

Contact For Quote

$2,500

$45,000

$40,000

$60,000

$150,000

$525,000

Contact For Quote

* T5 tokens to train from the original T5 paper. Chinchilla scaling laws not applicable.
‍
** Expected number of days, based on training experience to date, using a 4-node Cerebras Wafer-Scale Cluster. Actual training of model may take more or less time.

AMD Instinct Series Cloud

AMD Instinct™ Series

AMD Benefits

AMD ROCm Software

AMD Infinity Fabric Technology

Discover the Benefits of AMD Instinct hosted by Cirracale

Optimal Performance at the Right Price

Ease of Use

Ease of Use

Simple & Secure Cloud Operations

AMD Products

AMD Instinct MI350 Series

AMD Instinct MI300X

AMD Instinct MI325X

AMD Instinct MI250

AMD Instinct Series Instance Pricing

NVIDIA GPU Cloud

Qualcomm Cloud AI 100 Series Bare-Metal Pricing

The Cerebras AI Model Studio

Fine-Tuning - Standard Offering Pricing

Fixed-Price Production Model Training

Ready To Get Started?