Platform Comparison

Quick Comparison Table

Feature	On-Demand GPUs	Reserved	Private Cloud
Best for	Training, fine-tuning, experiments, burst compute	Sustained 24/7 workloads, predictable capacity	Production-scale, lowest pricing, long-term
Setup time	< 5 minutes	< 5 minutes	Custom (days); capacity confirmed in 24h
Minimum commitment	None (hourly)	1 month (Self-serve terms available from 1 week to 1 month)	Long-term (multi-month to multi-year)
Pricing model	$/GPU/hour	Discounted prepaid $/GPU/hour	Custom contract
GPU access	Full SSH / root (VM or bare metal)	Full SSH / root (VM or bare metal)	Full root, single-tenant; optional managed k8s / Slurm
Scaling	Manual; self-serve multi-node	Self-serve; reserved up front	Pre-provisioned, custom topology
Networking	Ethernet or InfiniBand (multi-node)	Ethernet or InfiniBand (multi-node)	InfiniBand, private subnets, network isolation
Support level	Standard (sub-24h; P0 < 1h)	Standard	Dedicated, direct-to-engineer (24×7)
SLA	99.5% uptime	99.5% uptime	Custom, negotiated per contract

GPU rates vary by region and availability. Always check app.hyperbolic.ai/gpus for current rates and real-time availability.

Detailed Comparison

On-Demand GPUs

Self-serve, pay-as-you-go GPU instances drawn from an aggregated supplier network. Spin up a single GPU or an interconnected multi-node cluster in minutes — no contracts, no upfront commitment, no sales calls.When to use:

Training and fine-tuning custom models
Running experiments, notebooks, and evaluations
Burst and batch processing jobs
Short training runs where you need full control of the environment

Key features:

Full root access via SSH
Virtual Machine or Bare Metal configurations
Deploy interconnected H100, H200, or B200 clusters (8, 16, 32, 64, 128+ GPUs) instantly
Up to 24 TB local NVMe storage; attachable network storage (~$0.0766/TB/month)
Hourly billing with no hidden or egress fees — failed instances are never charged

Reserved

The same GPU capacity you would use on-demand, reserved for a fixed term at a discounted rate. Same platform, same hardware, same setup — you simply commit up front for a lower price.When to use:

24/7 production inference and LLM tooling
Sustained or scheduled training runs
High-volume, predictable usage
Teams that want a discount on capacity they already run on-demand

Key features:

Same on-demand GPUs, networking, and environment — reserved for your term
Self-serve in the app; larger commitments can also be arranged with our team
Discounted rate in exchange for committing to a term
Reserved instances run for the full term and are not terminated early

Pricing structure:

Discounted prepaid $/GPU/hour — longer terms unlock lower rates
Paid up front for the reservation period
See Pricing for details

Private Cloud

Dedicated, single-tenant GPU infrastructure for production-scale AI workloads. Get an isolated environment with custom networking, storage, and SLAs — plus the best unit economics on long-term commitments.When to use:

Enterprise production workloads at sustained scale
Security-conscious teams that require single-tenant isolation
Large, long-term commitments (typically $1M+ deployments)
Workloads that need custom SLAs, custom networking, or custom storage

Key features:

Single-tenant, customer-isolated GPU clusters (H100, H200, B200)
Network isolation with private subnets and firewall rules
High-bandwidth InfiniBand interconnect for distributed training
Choice of operating model: full platform visibility, managed Kubernetes, managed Slurm, or a fully isolated environment where only your team holds credentials
Custom storage architecture (node-local NVMe and shared/parallel filesystems)
Dedicated, direct-to-engineer support with 24×7 coverage for critical issues

Pricing structure:

Custom contract based on cluster size, GPU type, term length, and configuration
Long-term commitments with the best unit economics
Talk to sales for a tailored quote

Available GPUs

Hyperbolic offers H100, H200, and B200 capacity across On-Demand GPUs, Reserved, and Private Cloud. Availability may vary by product, region, supply, and configuration. If you need a specific GPU type, cluster configuration, or capacity profile, contact sales@hyperbolic.ai and our team can help evaluate what is available or what can be sourced for your workload.

GPU	Architecture	Memory	Memory Bandwidth	NVLink	TDP	Best for
B200	Blackwell	192 GB HBM3e	8 TB/s	1.8 TB/s	~1000 W	Frontier-scale training and highest-throughput inference
H200	Hopper	141 GB HBM3e	4.8 TB/s	900 GB/s	700 W	Large-model training and memory-bound inference
H100	Hopper	80 GB HBM3	3.35 TB/s	900 GB/s	700 W	Mainstream training, fine-tuning, and inference

Node configuration (H100 / H200): 8 GPUs per node with up to 160 vCPUs and up to 1.5 TB RAM per node, and up to 24 TB of local NVMe storage. Available as Virtual Machines (fast, flexible, ideal for development and small-to-medium training) or Bare Metal (full root access and system-level control, ideal for large-scale distributed training and custom CUDA configurations). Interconnect: Single-node instances use standard networking; multi-node clusters can use InfiniBand (up to 3.2 Tb/s, NDR / ConnectX-7) for low-latency GPU-to-GPU communication — essential for distributed training at 32+ GPUs.

Choosing between H100, H200, and B200? Use H100 for mainstream training, fine-tuning, and inference. Choose H200 for larger models, longer context, or memory-bound inference. Choose B200 for maximum per-GPU throughput on frontier-scale training and inference.

Need Help Deciding?

Talk to Sales

Get personalized recommendations and custom quotes for Reserved and Private Cloud

Contact Support

Get help from our support team

Get an Instant Quote

Price a reserved cluster in-app in minutes

Get Started

Launch your first GPU instance in minutes

Overview

GPUs

References

Account & Billing

Quick Comparison Table

Detailed Comparison

On-Demand GPUs

Reserved

Private Cloud

Available GPUs

Need Help Deciding?

Talk to Sales

Contact Support

Get an Instant Quote

Get Started

​Quick Comparison Table

​Detailed Comparison

​On-Demand GPUs

​Reserved

​Private Cloud

​Available GPUs

​Need Help Deciding?

Talk to Sales

Contact Support

Get an Instant Quote

Get Started

Quick Comparison Table

Detailed Comparison

On-Demand GPUs

Reserved

Private Cloud

Available GPUs

Need Help Deciding?