
H100 PCIe
High-performance data center GPU based on Hopper architecture with 80GB HBM3 memory and 14,592 CUDA cores for AI training, machine learning, and enterprise workloads.
GPU Benchmarks
Compare performance across LLMs and image models to find the best GPU for your workload.

Benchmarks were run using vLLM in May 2025 with Runpod GPUs

High-performance data center GPU based on Hopper architecture with 80GB HBM3 memory and 14,592 CUDA cores for AI training, machine learning, and enterprise workloads.
.webp)
Dual-GPU data center accelerator based on Hopper architecture with 188GB combined HBM3 memory (94GB per GPU) designed specifically for LLM inference and deployment.

High-efficiency LLM processing at 90.98 tok/s.
Benchmarks were run using Hugging Face Diffusers in May 2025 on Runpod GPUs.

Unmatched image gen speed with 49.9 images per minute.

AI image processing at 40.3 images per minute.

Pro-grade performance with 36 images per minute.
Case Studies