Announcing Runpod Flash

RTX 3090
rtx-3090
L4
l4
RTX 3090
rtx-3090
H100 SXM
h100-sxm
RTX 3090
rtx-3090
H100 PCIe
h100-pcie
RTX 3090
rtx-3090
H100 NVL
h100-nvl
RTX 3090
rtx-3090
A40
a40
RTX 3090
rtx-3090
A100 SXM
a100-sxm
RTX 3090
rtx-3090
A100 PCIe
a100-pcie
RTX 2000 Ada
rtx-2000-ada
RTX A6000
rtx-a6000
RTX 2000 Ada
rtx-2000-ada
RTX A5000
rtx-a5000
RTX 2000 Ada
rtx-2000-ada
RTX A4000
rtx-a4000
RTX 2000 Ada
rtx-2000-ada
RTX 6000 Ada
rtx-6000-ada
RTX 2000 Ada
rtx-2000-ada
RTX 4090
rtx-4090
RTX 2000 Ada
rtx-2000-ada
RTX 3090
rtx-3090
RTX 2000 Ada
rtx-2000-ada
L40S
l40s
RTX 2000 Ada
rtx-2000-ada
L40
l40
RTX 2000 Ada
rtx-2000-ada
L4
l4
RTX 2000 Ada
rtx-2000-ada
H100 SXM
h100-sxm
RTX 2000 Ada
rtx-2000-ada
H100 PCIe
h100-pcie
RTX 2000 Ada
rtx-2000-ada
H100 NVL
h100-nvl
RTX 2000 Ada
rtx-2000-ada
A40
a40
RTX 2000 Ada
rtx-2000-ada
A100 SXM
a100-sxm
RTX 2000 Ada
rtx-2000-ada
A100 PCIe
a100-pcie
L40S
l40s
RTX A6000
rtx-a6000
L40S
l40s
RTX A5000
rtx-a5000
L40S
l40s
RTX A4000
rtx-a4000
L40S
l40s
RTX 6000 Ada
rtx-6000-ada
L40S
l40s
RTX 4090
rtx-4090
L40S
l40s
RTX 3090
rtx-3090
L40S
l40s
RTX 2000 Ada
rtx-2000-ada
L40S
l40s
L40
l40
L40S
l40s
L4
l4
L40S
l40s
H100 SXM
h100-sxm
L40S
l40s
H100 PCIe
h100-pcie
L40S
l40s
H100 NVL
h100-nvl
L40S
l40s
A40
a40
L40S
l40s
A100 SXM
a100-sxm
L40S
l40s
A100 PCIe
a100-pcie
L40
l40
RTX A6000
rtx-a6000
L40
l40
RTX A5000
rtx-a5000
L40
l40
RTX A4000
rtx-a4000
L40
l40
RTX 6000 Ada
rtx-6000-ada
L40
l40
RTX 4090
rtx-4090
L40
l40
RTX 3090
rtx-3090
L40
l40
RTX 2000 Ada
rtx-2000-ada
L40
l40
L40S
l40s
L40
l40
L4
l4
L40
l40
H100 SXM
h100-sxm
L40
l40
H100 PCIe
h100-pcie
L40
l40
H100 NVL
h100-nvl
L40
l40
A40
a40
L40
l40
A100 SXM
a100-sxm
L40
l40
A100 PCIe
a100-pcie
L4
l4
RTX A6000
rtx-a6000
L4
l4
RTX A5000
rtx-a5000
L4
l4
RTX A4000
rtx-a4000
L4
l4
RTX 6000 Ada
rtx-6000-ada
L4
l4
RTX 4090
rtx-4090
L4
l4
RTX 3090
rtx-3090
L4
l4
RTX 2000 Ada
rtx-2000-ada
L4
l4
L40S
l40s
L4
l4
L40
l40
L4
l4
H100 SXM
h100-sxm
L4
l4
H100 PCIe
h100-pcie
L4
l4
H100 NVL
h100-nvl
L4
l4
A40
a40
L4
l4
A100 SXM
a100-sxm
L4
l4
A100 PCIe
a100-pcie
H100 SXM
h100-sxm
RTX A6000
rtx-a6000
H100 SXM
h100-sxm
RTX A5000
rtx-a5000
H100 SXM
h100-sxm
RTX A4000
rtx-a4000
H100 SXM
h100-sxm
RTX 6000 Ada
rtx-6000-ada
H100 SXM
h100-sxm
RTX 4090
rtx-4090
H100 SXM
h100-sxm
RTX 3090
rtx-3090
H100 SXM
h100-sxm
RTX 2000 Ada
rtx-2000-ada
H100 SXM
h100-sxm
L40S
l40s
H100 SXM
h100-sxm
L40
l40
H100 SXM
h100-sxm
L4
l4
H100 SXM
h100-sxm
H100 PCIe
h100-pcie
H100 SXM
h100-sxm
H100 NVL
h100-nvl
H100 SXM
h100-sxm
A40
a40
H100 SXM
h100-sxm
A100 SXM
a100-sxm
H100 SXM
h100-sxm
A100 PCIe
a100-pcie
H100 PCIe
h100-pcie
RTX A6000
rtx-a6000
H100 PCIe
h100-pcie
RTX A5000
rtx-a5000
H100 PCIe
h100-pcie
RTX A4000
rtx-a4000
H100 PCIe
h100-pcie
RTX 6000 Ada
rtx-6000-ada
H100 PCIe
h100-pcie
RTX 4090
rtx-4090
H100 PCIe
h100-pcie
RTX 3090
rtx-3090
H100 PCIe
h100-pcie
RTX 2000 Ada
rtx-2000-ada
H100 PCIe
h100-pcie
L40S
l40s
H100 PCIe
h100-pcie
L40
l40
H100 PCIe
h100-pcie
L4
l4
H100 PCIe
h100-pcie
H100 SXM
h100-sxm
H100 PCIe
h100-pcie
H100 NVL
h100-nvl
H100 PCIe
h100-pcie
A40
a40
H100 PCIe
h100-pcie
A100 SXM
a100-sxm
H100 PCIe
h100-pcie
A100 PCIe
a100-pcie
H100 NVL
h100-nvl
RTX A6000
rtx-a6000
H100 NVL
h100-nvl
RTX A5000
rtx-a5000
H100 NVL
h100-nvl
RTX A4000
rtx-a4000

L4 vs H100 PCIe

Compare performance across LLMs and image models to find the best GPU for your workload.

L4
vs.
H100 PCIe
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

LLM inference benchmarks.

Benchmarks were run using vLLM in May 2025 with Runpod GPUs

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
L4

L4

Energy-efficient data center GPU based on Ada Lovelace architecture with 24GB GDDR6 memory and 7,424 CUDA cores for AI inference, video processing, and edge computing applications.

H100 PCIe

H100 PCIe

High-performance data center GPU based on Hopper architecture with 80GB HBM3 memory and 14,592 CUDA cores for AI training, machine learning, and enterprise workloads.

H100 PCIe

H100 PCIe

High-efficiency LLM processing at 90.98 tok/s.

Image generation benchmarks.

Benchmarks were run using Hugging Face Diffusers in May 2025 on Runpod GPUs.

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
H100 SXM

H100 SXM

Unmatched image gen speed with 49.9 images per minute.

H100 NVL

H100 PCIe

AI image processing at 40.3 images per minute.

H100 PCIe

H100 PCIe

Pro-grade performance with 36 images per minute.

Real-world GPU
performance in action.

See how teams optimize cost and performance with the right GPU for their workloads.
Aneta

"Runpod has changed the way we ship because we no longer have to wonder if we have access to GPUs. We've saved probably 90% on our infrastructure bill, mainly because we can use bursty compute whenever we need it."

Gendo

"Runpod has allowed the team to focus more on the features that are core to our product and that are within our skill set, rather than spending time focusing on infrastructure, which can sometimes be a bit of a distraction.”

Civit AI

"Runpod helped us scale the part of our platform that drives creation. That’s what fuels the rest—image generation, sharing, remixing. It starts with training."

Scatter Lab

"Runpod allowed us to reliably handle scaling from zero to over 1,000 requests per second in our live application."

InstaHeadshots

"Runpod has allowed us to focus entirely on growth and product development without us having to worry about the GPU infrastructure at all."

KRNL

"We could stop worrying about infrastructure and go back to building. That’s the real win.”

Coframe

“The main value proposition for us was the flexibility Runpod offered. We were able to scale up effortlessly to meet the demand at launch.”

Glam AI

"After migration, we were able to cut down our server costs from thousands of dollars per day to only hundreds."

Segmind

Runpod’s scalable GPU infrastructure gave us the flexibility we needed to match customer traffic and model complexity—without overpaying for idle resources.

Build what’s next.

The most cost-effective platform for building, training, and scaling machine learning models—ready when you are.