Pricing Serverless Blog Docs

Contact sales Sign Up Login

Explore our credit programs for startups and researchers.

Compare GPU Performance on AI Workloads

A40

A40 image

Vs.

RTX 2000 Ada

RTX 2000 Ada image

LLM Benchmarks

Benchmarks were run on RunPod gpus using vllm. For more details on vllm, check out the vllm github repository.

Metric

Model

Tokens

Batch Size

Output Throughput (tok/s)

Get started with RunPod

today.

We handle millions of gpu requests a day. Scale your machine learning workloads while keeping costs low with RunPod.

Copyright © 2025. All rights reserved.