_{CLUSTER OPTIONS}

Choose how you run your cluster

Start instantly or reserve dedicated capacity for long-term workloads.

On-Demand

Instant Clusters

Multi-node compute ready in minutes, with no contract required to get started.

Up to 64 H100/H200 GPUs
Available now, with more capacity by request.

InfiniBand + RoCE v2 networking
Near bare-metal NCCL performance, validated by SemiAnalysis.

Slurm pre-configured
Launch distributed workloads without building orchestration yourself.

Per-hour billing
No reservation required.

Deploy in minutes
Tear down anytime.

Deploy a Cluster

Reserved

Reserved Clusters

Dedicated capacity with predictable pricing and long-term support for sustained workloads.

10,000+ GPUs reserved capacity
For larger training runs and sustained demand.

Single-tenant infrastructure
Isolated environments for teams that need supply certainty.

One-month minimum commitment
Built for workloads that need predictable access.

Committed pricing with volume discounts
Custom pricing for long-term planning.

SLA-backed uptime and dedicated support
Priority support for critical workloads.

Talk to Sales

Networking that doesn't bottleneck your workload

High-performance networking for distributed AI training.

InfiniBand
networking

‍

High-bandwidth, low-latency networking for distributed training.

RoCE v2
support

‍

RDMA over Ethernet for flexible, high-performance workloads.

Inter-node
connectivity

‍

High-speed communication across nodes at scale.

Framework
compatibility

‍

Compatible with NCCL, MPI, DeepSpeed, Axolotl, and more.

Learn more

Operations and tooling built for distributed workloads

Tools and infrastructure designed for how teams actually run clusters.

Slurm-native
orchestration

‍

Run distributed workloads with built-in scheduling and resource management.

Cluster
monitoring

‍

Track GPU, memory, and disk usage from a single dashboard.

Dynamic node
management

‍

Add or scale nodes without rebuilding your cluster.

Shared storage
volumes

‍

Persistent storage accessible across nodes for large datasets and models.

SSH access to
every node

‍

Direct access for debugging, setup, and workflow control.

Container-native
workflows

‍

Bring your own Docker images and manage your full software stack.

Deep Cogito trained its 671B mixture-of-experts model on Runpod Instant Clusters, demonstrating the scale possible with distributed infrastructure on demand.

Deep Cogito

Foundation model training

The main value proposition for us was the flexibility Runpod offered. We were able to scale up effortlessly to meet the demand at launch.

Coframe

Production inference scaling

Runpod cluster networking delivered near bare-metal NCCL performance in third-party benchmarking.

semi analysis

Independent performance benchmarking

Operations and tooling built for distributed workloads

Tools and infrastructure designed for how teams actually run clusters.

SOC 2
Type II

‍

Certified for security, availability, and confidentiality.

HIPAA
compliance

‍

HIPAA-compliant environments available for regulated workloads.

GDPR
compliance

‍

Supports GDPR requirements for organizations operating in the EU.

Single-tenant infrastructure

‍

Isolated environments for strict data governance and separation.

Flexible pricing for every stage of your workflow

‍

Runpod Clusters support both on-demand and reserved capacity, giving teams a clear path from fast experimentation to committed infrastructure at scale.

Talk to sales

On-demand clusters available now
Spin up multi-node clusters with per-hour pricing and no long-term commitment.

Reserved capacity for sustained workloads
Secure dedicated infrastructure for larger training runs and predictable production demand.

Committed pricing with volume discounts
Reserved deployments include pricing structures designed for long-term capacity planning.

Built to scale from 64 to 10,000+ GPUs
Start with self-serve clusters or work with our team on dedicated single-tenant infrastructure.

What reserved capacity unlocks

Work directly with our team to design infrastructure, pricing, and support tailored to your production requirements.

Dedicated
infrastructure

‍

Single-tenant cluster infrastructure fully reserved for your workloads.

Predictable
capacity

‍

Secure a baseline GPU allocation with options to burst as demand increases.

Volume
pricing

‍

Access committed pricing and volume discounts for sustained workloads.

SLA-backed
reliability

‍

Uptime guarantees and contractual SLAs designed for production environments.

Dedicated
support

‍

Direct access to engineering support, escalation paths, and onboarding assistance.

Compliance &
contracts

‍

SOC 2, BAA, and DPA documentation, along with flexible contract structures.

Big workloads need dedicated infrastructure. We’ll build it with you.

Tell us about your workload and GPU requirements. Our team typically follows up within one business day.

Talk to sales

Deploy a cluster

Multi-node GPU clusters in the cloud. Deploy in minutes, not months.

Trusted by teams running production AI

Trusted by research teams and AI companies building at scale

Choose how you run your cluster

Instant Clusters

Reserved Clusters

Networking that doesn't bottleneck your workload

InfiniBand
networking

RoCE v2
support

Inter-node
connectivity

Framework
compatibility

Operations and tooling built for distributed workloads

Slurm-native
orchestration

Cluster
monitoring

Dynamic node
management

Shared storage
volumes

SSH access to
every node

Container-native
workflows

Built for the workloads that push beyond a single node

Foundation model training

Fine-tuning
at scale

Distributed
inference

AI
research

Simulation
and HPC

Batch
processing

Operations and tooling built for distributed workloads

SOC 2
Type II

HIPAA
compliance

GDPR
compliance

Single-tenant infrastructure

Flexible pricing for every stage of your workflow

What reserved capacity unlocks

Dedicated
infrastructure

Predictable
capacity

Volume
pricing

SLA-backed
reliability

Dedicated
support

Compliance &
contracts

Big workloads need dedicated infrastructure. We’ll build it with you.

Multi-node GPU clusters in the cloud. Deploy in minutes, not months.

Trusted by teams running production AI

Trusted by research teams and AI companies building at scale

Choose how you run your cluster

Instant Clusters

Reserved Clusters

Networking that doesn't bottleneck your workload

InfiniBandnetworking

RoCE v2support

Inter-nodeconnectivity

Frameworkcompatibility

Operations and tooling built for distributed workloads

Slurm-nativeorchestration

Clustermonitoring

Dynamic nodemanagement

Shared storagevolumes

SSH access toevery node

Container-nativeworkflows

Built for the workloads that push beyond a single node

Foundation model training

Fine-tuningat scale

Distributedinference

AIresearch

Simulationand HPC

Batchprocessing

Operations and tooling built for distributed workloads

SOC 2Type II

HIPAAcompliance

GDPRcompliance

Single-tenant infrastructure

Flexible pricing for every stage of your workflow

What reserved capacity unlocks

Dedicatedinfrastructure

Predictablecapacity

Volumepricing

SLA-backedreliability

Dedicatedsupport

Compliance &contracts

Big workloads need dedicated infrastructure. We’ll build it with you.

InfiniBand
networking

RoCE v2
support

Inter-node
connectivity

Framework
compatibility

Slurm-native
orchestration

Cluster
monitoring

Dynamic node
management

Shared storage
volumes

SSH access to
every node

Container-native
workflows

Fine-tuning
at scale

Distributed
inference

AI
research

Simulation
and HPC

Batch
processing

SOC 2
Type II

HIPAA
compliance

GDPR
compliance

Dedicated
infrastructure

Predictable
capacity

Volume
pricing

SLA-backed
reliability

Dedicated
support

Compliance &
contracts