On-Demand and Reserved

DeployInstantly

From zero to training in minutes—launch high-performance clusters without the usual overhead.

Hero image

Why Hyperbolic On-Demand

Affordable compute

Rent GPUs starting at $0.20/GPU/hr, cutting compute costs for training and inference.

Right GPU for Right Workloads

Choose from H100 SXM, RTX 3070, NVIDIA H200, RTX 4090, RTX 3080 — optimized for AI/ML workloads.

Flexible payments

Pay with wire / ACH upfront or monthly, or pay as you go via credit card / stripe

Secure SSH access

Authenticate via SSH key pairs for secure remote access (public key uploaded, private key stays local).

Smart billing notifications

Get notified within 3 minutes if an instance fails. No charges for failed instances — only pay for GPUs that come online.

Agent-compatible API

Automate GPU provisioning by allowing your AI agents or scripts to spin up and manage instances via API.

Pre-built Docker images

Skip setup and launch GPU workloads instantly with ready-to-use images for PyTorch, TensorFlow, and CUDA.

Clustered GPU allocation

Rent multiple GPUs in a cluster to unlock additional savings and maximized efficiency.

Hero image

More Flexibility,

Less Overhead

No vendor lock-in, instant availability,
and up to 75% cost savings.

AWS

Azure

CoreWeave

Fluidstack

Lambda Labs

RunPod

How it Works

A simple guide for how to get your AI workloads running on Hyperbolic
GPUs at affordable rates.

Choose your setup: fast VMs or bare metal performance

Set your GPU count: scale from a single node to 1000+ GPUs

Pick your interconnect: InfiniBand or Ethernet

Launch a cluster in minutes with no provisioning delays

Nvidia H100 SXM

$1.50 / HR

Nvidia H200

$2.40 / HR

Nvidia B200

$3.50 / HR

Nvidia RTX 4090

$0.30 / HR

Nvidia RTX 3070

$0.16 / HR

Start TrainingStart Training

Note: Pricing is refreshed weekly based on the best available rates from suppliers on our platform.

Guaranteed Capacity for Long Term Training

Lock in guaranteed GPU capacity for long-running training, fine-tuning, and scaling—without job interruptions or preemption.

Built for Every Workload

  • Evaluating Open Models at Scale

  • Generative AI development

Hyperbolic's computing platform has provided robust and reliable support for our Chatbot Arena. We run our FastChat and SLang applications on this platform to serve state-of-the-art open vision-language models. We are thrilled to leverage their solutions to deliver exceptional user experiences.

Lianmin Zheng

Lianmin Zheng

Member of Technical Staff, xAI