Blog

Runpod AI Infrastructure Blog

Runpod product updates, AI infrastructure guides, GPU tutorials, and deployment patterns for developers building with cloud GPUs.

Moritz Wallawitsch

May 31, 2024

Introduction to vLLM and PagedAttention

Learn how vLLM achieves higher throughput than Hugging Face Transformers by using PagedAttention to eliminate memory waste, boost inference.

AI Workloads

How to Run vLLM on Runpod Serverless (Beginner-Friendly Guide)

Moritz Wallawitsch

May 31, 2024

How to Run vLLM on Runpod Serverless (Beginner-Friendly Guide)

Learn how to run vLLM on Runpod’s serverless GPU platform. This guide walks you through fast, efficient LLM inference without complex setup.

AI Infrastructure

Introducing Serverless CPU: High-Performance VMs Without GPUs

Brendan McKeag

May 28, 2024

Introducing Serverless CPU: High-Performance VMs Without GPUs

Our new Serverless CPU offering lets you launch high-performance containers without GPUs, perfect for lighter workloads, dev tasks, and automation.

Product Updates

Announcing Runpod's New Serverless CPU Feature

Brendan McKeag

May 28, 2024

Announcing Runpod's New Serverless CPU Feature

Runpod introduces Serverless CPU: high-performance VM containers with customizable CPU options, ideal for cost-effective and versatile workloads not.

Product Updates

Enable SSH Password Authentication on a Runpod Pod

River Snow

May 16, 2024

Enable SSH Password Authentication on a Runpod Pod

Learn how to securely access your Runpod Pod using SSH with a username and password by configuring the SSH daemon and setting a root password.

Learn AI

Runpod's $20MM Milestone: Fueling Our Vision, Empowering Our Team

Zhen Lu

May 8, 2024

Runpod's $20MM Milestone: Fueling Our Vision, Empowering Our Team

Runpod has raised $20MM in a funding round led by Intel Capital and Dell Technologies Capital, fueling our mission to power AI/ML cloud computing and.

Product Updates

Refocusing on Core Strengths: The Shift from Managed AI APIs to Serverless Flexibility

Justin Merrell

April 29, 2024

Refocusing on Core Strengths: The Shift from Managed AI APIs to Serverless Flexibility

Runpod is sunsetting Managed AI APIs to focus on Serverless, empowering users with greater control, flexibility, and streamlined infrastructure for.

Product Updates

Poddy mascot displayed as a retro TV with static, indicating no results found

We couldn't find anything. Try a different search.

Build what’s next.

Build, train, and scale AI workloads on Runpod with cloud GPUs, Serverless, and Clusters.

Get started