
B200
Next-generation data center GPU based on Blackwell architecture that features 192GB of HBM3e memory with 8TB/s bandwidth, delivering up to 20 petaFLOPS of FP4 AI compute performance.
GPU Benchmarks
Compare performance across LLMs and image models to find the best GPU for your workload.

Benchmarks were run using vLLM in May 2025 with Runpod GPUs

Next-generation data center GPU based on Blackwell architecture that features 192GB of HBM3e memory with 8TB/s bandwidth, delivering up to 20 petaFLOPS of FP4 AI compute performance.

High-performance data center GPU based on Ampere architecture with 80GB HBM2e memory and 6,912 CUDA cores for AI training, machine learning, and high-performance computing workloads.

High-efficiency LLM processing at 90.98 tok/s.
Benchmarks were run using Hugging Face Diffusers in May 2025 on Runpod GPUs.

Unmatched image gen speed with 49.9 images per minute.

AI image processing at 40.3 images per minute.

Pro-grade performance with 36 images per minute.
Case Studies