Cloud GPUs

Rent a cloud GPU. Any model. Under 30 seconds.

On-demand GPU rental with per-second billing. 30+ GPU models across 31 global regions. No minimums, no egress fees, no idle waste.

Get started

30-second GPU deploys

Spin up any GPU instance in under 30 seconds — no provisioning queues, no sales calls, no wait.

31 global regions

Rent GPU instances in 31 regions across the US, Europe, Asia, and Australia. Deploy where your users are, not where inventory is.

Per-second GPU billing

GPU rental billed by the second. No egress fees, no minimums, no surprises. Run a job for 3 minutes — pay for 3 minutes.

Trusted by top engineers at the world's leading companies.

GPU Pricing

30+ GPU models to rent. 31 global regions.

On-demand GPU cloud pricing with no long-term commitments. Rent by the second or lock in savings with reservations.

Get started

GPU

Community Cloud

Secure Cloud

Per hour

Per second

>80GB VRAM

H200

141 GB VRAM

276 GB RAM

vCPUs

$4.39/hr

B200

180 GB VRAM

283 GB RAM

vCPUs

$5.89/hr

RTX Pro 6000

96 GB VRAM

188 GB RAM

vCPUs

$2.09/hr

H100 NVL

94 GB VRAM

94 GB RAM

vCPUs

$3.19/hr

80GB VRAM

H100 PCIe

80 GB VRAM

188 GB RAM

vCPUs

$2.89/hr

H100 SXM

80 GB VRAM

125 GB RAM

vCPUs

$3.29/hr

A100 PCIe

80 GB VRAM

117 GB RAM

vCPUs

$1.39/hr

A100 SXM

80 GB VRAM

125 GB RAM

vCPUs

$1.49/hr

48GB VRAM

L40S

48 GB VRAM

94 GB RAM

vCPUs

$0.86/hr

RTX 6000 Ada

48 GB VRAM

167 GB RAM

vCPUs

$0.77/hr

A40

48 GB VRAM

50 GB RAM

vCPUs

$0.44/hr

L40

48 GB VRAM

94 GB RAM

vCPUs

$0.99/hr

RTX A6000

48 GB VRAM

50 GB RAM

vCPUs

$0.49/hr

32GB VRAM

RTX 5090

32 GB VRAM

35 GB RAM

vCPUs

$0.99/hr

24GB VRAM

24 GB VRAM

50 GB RAM

vCPUs

$0.39/hr

RTX 3090

24 GB VRAM

125 GB RAM

vCPUs

$0.46/hr

RTX 4090

24 GB VRAM

41 GB RAM

vCPUs

$0.69/hr

RTX A5000

24 GB VRAM

25 GB RAM

vCPUs

$0.27/hr

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Use Cases

GPU cloud instances for every workload

LLM Inference

Rent H100 or L40S instances for low-latency inference. Deploy vLLM, TGI, or a custom container in seconds.

Model Training & Fine-Tuning

A100 and H100 SXM GPU rentals built for long training runs. Persistent storage, multi-GPU support, no babysitting required.

AI Agents & Automation

On-demand GPU instances that spin up fast and scale with your workload. No cold starts, no idle costs.

Image & Video Generation

RTX 4090 and RTX A6000 rentals for image gen, video synthesis, and diffusion workflows.

"The Runpod team has clearly prioritized the developer experience to create an elegant solution that enables individuals to rapidly develop custom AI apps or integrations while also paving the way for organizations to truly deliver on the promise of AI."

Amjad Masad

"Runpod is the only place I can deploy high-end GPU models instantly—no sales calls, no rate limits, no nonsense."

Daniel Chang

“The main value proposition for us was the flexibility Runpod offered. We were able to scale up effortlessly to meet the demand at launch.”

Josh Payne

“Runpod helped us scale the part of our platform that drives creation. That’s what fuels the rest—image generation, sharing, remixing. It starts with training.”

Matty Shimura

Developer Tools

Built-in tools for managing your GPU rental at scale.

Manage every GPU instance from code. Runpod works in your terminal, your CI pipeline, and your deployment scripts.

Get started

Full API access.

Automate everything with a simple, flexible API.

CLI & SDKs.

Deploy and manage directly from your terminal.

GitHub & CI/CD.

Push to main, trigger builds, and deploy in seconds.

Storage Pricing

Persistent storage for your GPU cloud instances. No ingress or egress fees.

No fees for ingress/egress. Persistent and temporary storage available.

Pod Pricing

Storage Type

Running Pods

Idle Pods

Volume

$0.10/GB/mo

$0.20/GB/mo

Container Disk

$0.10/GB/mo

Persistent Network Storage

Storage Type

Under 1TB

Over 1TB

Network Volume

$0.07/GB/mo

$0.05/GB/mo

Save more with long-term GPU cloud commitments.

Prefer renting GPU capacity on a reserved basis? Get discounted rates on active and flex instances — no per-second premium, just predictable cloud GPU pricing.

FAQs

Questions? Answers.

Curious about unlocking GPU power in the cloud? Get clear answers to accelerate your projects with on-demand high-performance compute.

What are GPU Pods and how do they differ from other cloud GPU offerings?

GPU Pods are dedicated GPU instances you can spin up on Runpod. Unlike abstracted serverless GPUs, Pods give you full control over the underlying VM, drivers, and environment. You get a persistent instance (or ephemeral, if you prefer) with direct access to powerful GPUs, letting you run training, inference, or other workloads exactly how you want.

Which GPU models are available?

We offer 30+ GPU models, from entry-level inference cards to top-tier training accelerators. Examples include A100, H100, RTX 6000 Ada, L4/L40 series, and many more—over 30 options in total. You can pick any supported GPU when you launch a Pod, and new models roll out as soon as they’re live on the platform. For the latest availability, check the dashboard or query the API.

How is pricing structured?

Pricing is shown as an hourly rate but billed by the millisecond. You only pay for the exact time your Pod runs—if you start and stop a Pod in one minute, you’re charged just that minute. Storage volumes may incur minimal fees when attached, but compute costs are metered by the millisecond.

Can I bring my own Docker container or environment?

Yes. GPU Pods support custom Docker images. You can build an image with your preferred libraries and push it to a registry (Docker Hub, ECR, etc.), then reference it when you launch the Pod. That way you control the OS, drivers, and dependencies.

Which frameworks and runtimes are supported?

Any framework that runs on Linux and supports GPUs: PyTorch, TensorFlow, JAX, ONNX, CUDA toolkits, etc. Since you control the container, you can install whatever versions or additional tools you need (e.g., NCCL, Horovod). We provide base images with common ML stacks to speed up setup.

What about spot/preemptible GPUs?

We offer spot instances where GPU capacity is available at a discount, but with the risk of eviction when demand spikes. You can use them for fault-tolerant or batch workloads. The UI/API will indicate current spot availability and pricing.

How do I rent a GPU on Runpod?

Sign up, go to [Pods → Deploy], select your GPU model (H100, A100, RTX 4090, and 30+ others), choose a region, attach a container image, and click Deploy. Your GPU instance is live in under 30 seconds. Billing starts the moment it's running and stops the moment you terminate it — billed by the second, never by the hour.

What's the cheapest GPU rental on Runpod?

RTX A5000s start at $0.27/hr and L4s at $0.39/hr — both solid choices for inference, testing, and development. If you need more VRAM, A40s are $0.44/hr and RTX 3090s are $0.46/hr. All GPU rentals are billed by the second, so a short job costs a fraction of an hour's rate.

Can I rent H100 or A100 GPUs on-demand without a contract?

Yes. H100 PCIe instances are available on-demand from $2.89/hr and A100 PCIe from $1.39/hr, with no contracts or minimums. If you need guaranteed capacity at lower rates, ask about our reserved pricing. Either way, you're billed by the second — not by the hour or day.

Clients

750,000 developers chose Runpod without a sales call.

Engineered for teams building the future.

Start your GPU rental in 30 seconds.

No credit card required to explore. No minimums once you deploy. Just fast, on-demand GPU cloud access.

Get started