Hot starts, batch inference, and what's next for Runpod Serverless. Webinar June 25.

Shaamil Karim

Run vLLM on Runpod Serverless: Deploy Open Source LLMs in Minutes
Shaamil Karim
June 22, 2025

Run vLLM on Runpod Serverless: Deploy Open Source LLMs in Minutes

Learn when to use open source vs. closed source LLMs, and how to deploy models like Llama 3 or Qwen3 with vLLM on Runpod Serverless for high-throughput inference.

AI Workloads
All
Deploy Google Gemma 7B with vLLM on Runpod Serverless
Shaamil Karim
August 22, 2024

Deploy Google Gemma 7B with vLLM on Runpod Serverless

Deploy Google’s Gemma 7B model using vLLM on Runpod Serverless in just minutes. Learn how to optimize for speed, scalability, and cost-effective AI inference.

AI Workloads
All
Run Llama 3.1 with vLLM on Runpod Serverless
Shaamil Karim
August 20, 2024

Run Llama 3.1 with vLLM on Runpod Serverless

Discover how to deploy Meta's Llama 3.1 using Runpod's new vLLM worker. This guide walks you through model setup, performance benefits, and step-by-step.

AI Infrastructure
All
How to Run the FLUX Image Generator with ComfyUI on Runpod
Shaamil Karim
August 13, 2024

How to Run the FLUX Image Generator with ComfyUI on Runpod

Step-by-step guide for deploying FLUX with ComfyUI on Runpod. Perfect for creators looking to generate high-quality AI images with ease.

Learn AI
All
Run the Flux Image Generator on Runpod (Full Setup Guide)
Shaamil Karim
August 8, 2024

Run the Flux Image Generator on Runpod (Full Setup Guide)

This guide walks you through deploying the Flux image generator on a GPU using Runpod. Learn how to clone the repo, configure your environment, and start.

AI Workloads
All
Run SAM 2 on a Cloud GPU with Runpod (Step-by-Step Guide)
Shaamil Karim
August 2, 2024

Run SAM 2 on a Cloud GPU with Runpod (Step-by-Step Guide)

Learn how to deploy Meta's Segment Anything Model 2 (SAM 2) on a Runpod GPU using Jupyter Lab. This guide walks through installing dependencies.

AI Workloads
All
Poddy mascot displayed as a retro TV with static, indicating no results found
We couldn't find anything. Try a different search.

Build what’s next.

Build, train, and scale AI workloads on Runpod with cloud GPUs, Serverless, and Clusters.