Announcing Runpod Flash

Deploy any GPU. Any framework. Under 30 seconds.

Any framework. Any region. Billed by the second -- stop paying for idle compute.

30-second deploys

Deploy any GPU environment in under 30 seconds. No provisioning wait.

31 global regions

Deploy in 31 regions across the US, Europe, Asia, and Australia. Pick what’s closest to your users.

Per-second billing

Billed by the second. No egress fees. No minimums. Your bill does what you'd expect.

Trusted by top engineers at the world's leading companies.

30+ GPU models. 31 global regions.

Simple pricing plans for teams of all sizes, designed to scale with you.

GPU

>80GB VRAM

H200

141 GB VRAM
276 GB RAM
24
vCPUs
$3.59/hr

B200

180 GB VRAM
283 GB RAM
28
vCPUs
$5.98/hr

RTX Pro 6000

96 GB VRAM
188 GB RAM
16
vCPUs
$1.69/hr

H100 NVL

94 GB VRAM
94 GB RAM
16
vCPUs
$2.59/hr
80GB VRAM

H100 PCIe

80 GB VRAM
188 GB RAM
16
vCPUs
$1.99/hr

H100 SXM

80 GB VRAM
125 GB RAM
20
vCPUs
$2.69/hr

A100 PCIe

80 GB VRAM
117 GB RAM
8
vCPUs
$1.19/hr

A100 SXM

80 GB VRAM
125 GB RAM
16
vCPUs
$1.39/hr
48GB VRAM

L40S

48 GB VRAM
94 GB RAM
16
vCPUs
$0.79/hr

RTX 6000 Ada

48 GB VRAM
167 GB RAM
10
vCPUs
$0.74/hr

A40

48 GB VRAM
50 GB RAM
9
vCPUs
$0.35/hr

L40

48 GB VRAM
94 GB RAM
8
vCPUs
$0.69/hr

RTX A6000

48 GB VRAM
50 GB RAM
9
vCPUs
$0.33/hr
32GB VRAM

RTX 5090

32 GB VRAM
35 GB RAM
9
vCPUs
$0.69/hr
24GB VRAM

L4

24 GB VRAM
50 GB RAM
12
vCPUs
$0.44/hr

RTX 3090

24 GB VRAM
125 GB RAM
16
vCPUs
$0.22/hr

RTX 4090

24 GB VRAM
41 GB RAM
6
vCPUs
$0.34/hr

RTX A5000

24 GB VRAM
25 GB RAM
9
vCPUs
$0.16/hr
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
"The Runpod team has clearly prioritized the developer experience to create an elegant solution that enables individuals to rapidly develop custom AI apps or integrations while also paving the way for organizations to truly deliver on the promise of AI."

Amjad Masad

"Runpod is the only place I can deploy high-end GPU models instantly—no sales calls, no rate limits, no nonsense."

Daniel Chang

“The main value proposition for us was the flexibility Runpod offered. We were able to scale up effortlessly to meet the demand at launch.”

Josh Payne

“Runpod helped us scale the part of our platform that drives creation. That’s what fuels the rest—image generation, sharing, remixing. It starts with training.”

Matty Shimura

Built-in developer tools & integrations.

Runpod works wherever you build — in code, in your terminal, in your pipeline.

Full API access.

Automate everything with a simple, flexible API.

CLI & SDKs.

Deploy and manage directly from your terminal.

GitHub & CI/CD.

Push to main, trigger builds, and deploy in seconds.

Persistent storage. No ingress fees. No egress fees.

No fees for ingress/egress. Persistent and temporary storage available.

Pod Pricing
Storage Type
Running Pods
Idle Pods
Volume
$0.10/GB/mo
$0.20/GB/mo
Container Disk
$0.10/GB/mo
NA
Persistent Network Storage
Storage Type
Under 1TB
Over 1TB
Network Volume
$0.07/GB/mo
$0.05/GB/mo

Gain additional savings
with reservations.

Save more with long-term commitments. Speak with our team to reserve discounted active and flex workers.

Questions? Answers.

Curious about unlocking GPU power in the cloud? Get clear answers to accelerate your projects with on-demand high-performance compute.

GPU Pods are dedicated GPU instances you can spin up on Runpod. Unlike abstracted serverless GPUs, Pods give you full control over the underlying VM, drivers, and environment. You get a persistent instance (or ephemeral, if you prefer) with direct access to powerful GPUs, letting you run training, inference, or other workloads exactly how you want.

We offer 30+ GPU models, from entry-level inference cards to top-tier training accelerators. Examples include A100, H100, RTX 6000 Ada, L4/L40 series, and many more—over 30 options in total. You can pick any supported GPU when you launch a Pod, and new models roll out as soon as they’re live on the platform. For the latest availability, check the dashboard or query the API.

Pricing is shown as an hourly rate but billed by the millisecond. You only pay for the exact time your Pod runs—if you start and stop a Pod in one minute, you’re charged just that minute. Storage volumes may incur minimal fees when attached, but compute costs are metered by the millisecond.

Yes. GPU Pods support custom Docker images. You can build an image with your preferred libraries and push it to a registry (Docker Hub, ECR, etc.), then reference it when you launch the Pod. That way you control the OS, drivers, and dependencies.

Any framework that runs on Linux and supports GPUs: PyTorch, TensorFlow, JAX, ONNX, CUDA toolkits, etc. Since you control the container, you can install whatever versions or additional tools you need (e.g., NCCL, Horovod). We provide base images with common ML stacks to speed up setup.

We offer spot instances where GPU capacity is available at a discount, but with the risk of eviction when demand spikes. You can use them for fault-tolerant or batch workloads. The UI/API will indicate current spot availability and pricing.

750,000 developers chose Runpod without a sales call.

Engineered for teams building the future.

Wix logo
Otovo logo
Scatter Lab logo
Abzu logo
Aneta logo
Perplexity logo
Replit logo
Civitai logo

Your first GPU pod is free.