Marut Pandya

Boost vLLM Performance on Runpod with GuideLLM

Marut Pandya

September 10, 2024

Boost vLLM Performance on Runpod with GuideLLM

Learn how to use GuideLLM to simulate real-world inference loads, fine-tune performance, and optimize cost for vLLM deployments on Runpod.

AI Workloads

AMD MI300X vs. Nvidia H100 SXM: Performance Comparison on Mixtral 8x7B Inference

Marut Pandya

July 1, 2024

AMD MI300X vs. Nvidia H100 SXM: Performance Comparison on Mixtral 8x7B Inference

Runpod benchmarks AMD's MI300X against Nvidia's H100 SXM using Mistral's Mixtral 8x7B model. The results highlight performance and cost trade-offs across.

AMD MI300X vs. NVIDIA H100: Mixtral 8x7B Inference Benchmark

Marut Pandya

July 1, 2024

AMD MI300X vs. NVIDIA H100: Mixtral 8x7B Inference Benchmark

We benchmarked AMD’s MI300X against NVIDIA’s H100 on Mixtral 8x7B. Discover which GPU delivers faster inference and better performance-per-dollar.

Poddy mascot displayed as a retro TV with static, indicating no results found

We couldn't find anything. Try a different search.

Build what’s next.

Build, train, and scale AI workloads on Runpod with cloud GPUs, Serverless, and Clusters.

Get started