Cirrascale Inference Platform Preview

Smarter Inferencing Starts Here

Introducing the world's first serverless inference-as-a-service platform for enterprise that intelligently selects the best accelerator for optimal performance and dynamically balances workloads across regions.

The Cirrascale Inference Platform is built from the ground up to go beyond existing inference solutions. Its unique capabilities analyze AI models, deploying them on the most ideal AI accelerators to balance performance and cost while maintaining enterprise-grade capabilities and security.

Inference for Enterprise at Scale

With numerous AI deployment options for inference, our core focus remains on delivering high uptime, resiliency, and scalability – even under the most demanding, high-volume inference scenarios. This is achieved through a serverless implementation that is effortless to deploy. Deployed pipelines benefit from dynamic regional balancing, ensuring the most consistent and performant experience possible. Large Language Models (LLMs), other Generative AI models, and multi-modal models are all supported, covering the full spectrum of enterprise workflow needs.

Automation to Right Size Your Needs

With the Cirrascale Inference Platform, the ideal AI accelerator is automatically chosen based on your specific requirements. Simply provide your AI model details, estimated scalability needs, and whether you need real-time or batch capabilities. The platform then assesses your real-world requirements and chooses the most appropriate AI acceleration technologies—without any user intervention. Peak-time usage (e.g. weekday business hours) can benefit from automatically scaled-up inference resources to handle higher token volumes, and scaling down once demand subsides. This provides more predictable billing than hyperscalers can provide while supporting higher token volumes.

Dynamic Regional Balancing

Cirrascale’s Inference Platform supports multiple regions for low latency connections to hyperscalers or on-premise infrastructure. Optional direct connect capabilities to leading hyperscaler regional zones enable even the most demanding, latency-sensitive, real-time applications—such as voice or multi-modal—to operate seamlessly. For larger data processing tasks, batch inference automatically leverages the most appropriate and available regional resources, with the option to cross over into other regions as needed for comprehensive batch inference requirements.

Discover the Benefits of the Cirrascale Inference Platform

Trusted High-Performance Cloud
  • Designed for performance, optimized for cost.
  • Scales to meet usage demands on the right AI accelerators.
  • Automatically balances workloads across regions.
Designed for Enterprise
  • Interconnects with existing infrastructure, whether at a hyperscaler or on-premise, for seamless workflow integration.
  • Utilize fine tuning and/or Retrieval Augmented Generation (RAG) to optimize model output for higher accuracy.
Configurable AI Pipelines
  • Supports open-source foundation models, distilled models, and proprietary models.
  • Offers real-time and batch inference capabilities.
Included Tools and Console
  • High performance with larger memory than other acceleration offerings.
  • Optimized for leading Generative AI models, including LLMs.

Learn More about the Cirrascale Inference Platform

Dive into the details of what makes the new Cirrascale Inference Platform revolutionary. Download the data sheet for a more in-depth look at the features and benefits that make up this enterprise inference platform.

Ready to Preview?

We're proud of the fact that we have worked with cloud pioneers from the very start. We were the trusted cloud backbone that helped OpenAI meet their cloud compute needs early on, and we continue to engage with today's bleeding edge AI companies, like yours.

Preview Acceptance

Due to the overwhelming response to preview the Cirrascale Inference Platform during its debut, access will be limited initially. Please be patient as we work our way through interested parties.

Questions Regarding Preview

Please send an email directly to preview@cirrascale.com with any specific questions regarding the early access to the platform. We will contact you as quickly as possible.

Request Preview Access

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Ready To Get Started?

Ready to take advantage of our flat-rate monthly billing, no ingress/egress data fees, and fast multi-tiered storage?

Get Started