Question 1

What are GPU Pods and how do they differ from other cloud GPU offerings?

Accepted Answer

GPU Pods are dedicated GPU instances you can spin up on Runpod. Unlike abstracted serverless GPUs, Pods give you full control over the underlying VM, drivers, and environment. You get a persistent instance (or ephemeral, if you prefer) with direct access to powerful GPUs, letting you run training, inference, or other workloads exactly how you want.

Question 2

Which GPU models are available?

Accepted Answer

We offer 30+ GPU models, from entry-level inference cards to top-tier training accelerators. Examples include A100, H100, RTX 6000 Ada, L4/L40 series, and many more—over 30 options in total. You can pick any supported GPU when you launch a Pod, and new models roll out as soon as they’re live on the platform. For the latest availability, check the dashboard or query the API.

Question 3

How is pricing structured?

Accepted Answer

Pricing is shown as an hourly rate but billed by the millisecond. You only pay for the exact time your Pod runs—if you start and stop a Pod in one minute, you’re charged just that minute. Storage volumes may incur minimal fees when attached, but compute costs are metered by the millisecond.

Question 4

Can I bring my own Docker container or environment?

Accepted Answer

Yes. GPU Pods support custom Docker images. You can build an image with your preferred libraries and push it to a registry (Docker Hub, ECR, etc.), then reference it when you launch the Pod. That way you control the OS, drivers, and dependencies.

Question 5

Which frameworks and runtimes are supported?

Accepted Answer

Any framework that runs on Linux and supports GPUs: PyTorch, TensorFlow, JAX, ONNX, CUDA toolkits, etc. Since you control the container, you can install whatever versions or additional tools you need (e.g., NCCL, Horovod). We provide base images with common ML stacks to speed up setup.

Question 6

What about spot/preemptible GPUs?

Accepted Answer

We offer spot instances where GPU capacity is available at a discount, but with the risk of eviction when demand spikes. You can use them for fault-tolerant or batch workloads. The UI/API will indicate current spot availability and pricing.

Deploy any GPU. Any framework. Under 30 seconds.

30-second deploys

31 global regions

Per-second billing

30+ GPU models. 31 global regions.

Built-in developer tools & integrations.

Full API access.

CLI & SDKs.

GitHub & CI/CD.

Persistent storage. No ingress fees. No egress fees.

Gain additional savings with reservations.

Questions? Answers.

750,000 developers chose Runpod without a sales call.

Your first GPU pod is free.