Question 1

What is the difference between a GPU pod and an Cluster?

Accepted Answer

A GPU pod is a single instance with one or more GPUs within the same node. A Cluster consists of multiple nodes interconnected with high-speed networking, allowing for workloads that span across multiple machines. Clusters are ideal for large model inference and distributed training that exceeds the capacity of a single node.

Question 2

What is the minimum and maximum cluster size?

Accepted Answer

Anyone can access 2 nodes on-demand with up to 16 GPUs. To access larger clusters up to 8 nodes (64 GPUs), you'll need to request a spend limit increase.

Question 3

How is billing handled for Clusters?

Accepted Answer

Clusters are billed by the second, just like our regular GPU pods. You're only charged for the compute time you actually use, with no minimum commitments or upfront costs. When you're done with your work, simply terminate the cluster to stop billing.

Question 4

What network bandwidth is available between nodes?

Accepted Answer

Clusters deliver 1,600–3,200 Gbps east-west bandwidth via InfiniBand or RoCE v2, depending on configuration.

Question 5

Are Cluster networks isolated between customers?

Accepted Answer

Yes. Each cluster’s east-west fabric is tenant-isolated. We enforce robust L2/L3 segmentation and RDMA fabric partitioning (e.g., InfiniBand P_Keys or RoCE v2 VLAN/VXLAN, depending on site), so there’s no routable path between tenants. Your 1.6–3.2 Tbps inter-node bandwidth is dedicated to your Runpod cluster—no cross-tenant visibility or traffic bleed.

Question 6

What storage solutions are available for large models?

Accepted Answer

Runpod offers native Network Storage integration where available, providing a shared filesystem layer that can be utilized across all nodes in your cluster. This is ideal for storing large models ranging from tens to hundreds of gigabytes close to your computing resources.

Question 7

Can I connect my cluster to AWS?

Accepted Answer

Yes, you can establish connections between your Runpod cluster and AWS environment through application layer mTLS, enabling secure bridging of workloads between platforms.

Question 8

Do you support Kubernetes or other container orchestration tools?

Accepted Answer

Currently, Clusters are not compatible with Kubernetes. The cluster environment is managed by Runpod's native orchestration system, eliminating the need for additional container orchestration tools or CNI configuration.

Question 9

Can I run Slurm on Clusters?

Accepted Answer

Yes, Clusters fully support Slurm for workload management.

Question 10

Are there any minimum lease terms or contract requirements?

Accepted Answer

No, there are absolutely no minimum lease terms for Clusters. You have complete flexibility to deploy and terminate clusters as needed to support your workloads, with no long-term commitments or contract obligations.

3,200 Gbps Infiniband GPU Clusters

Clusters

Reserved Clusters

Clusters: Available instantly, no contract required.

Launch a cluster in minutes.

Run any docker workload.

Orchestrate with Slurm.

Stop your cluster at any time.

Launch in minutes.

Pay by the second.

Scale globally.

Reserved Clusters: Dedicated capacity for large-scale workloads.

Uptime guarantee

Secure by default

Scale to thousands
of GPUs

Trusted by today's leaders, built for tomorrow's pioneers.

Questions? Answers.

Build what’s next.