Announcing Runpod Flash

Brendan McKeag

Streamline GPU Cloud Management with Runpod’s New REST API
Brendan McKeag
March 10, 2025

Streamline GPU Cloud Management with Runpod’s New REST API

Runpod’s new REST API lets you manage GPU workloads programmatically—launch, scale, and monitor pods without ever touching the dashboard.

AI Infrastructure
All
Enhanced CPU Pods Now Support Docker and Network Volumes
Brendan McKeag
March 3, 2025

Enhanced CPU Pods Now Support Docker and Network Volumes

We’ve upgraded Runpod CPU pods with Docker runtime and network volume support—giving you more flexibility, better storage options, and smoother dev workflows.

Product Updates
All
Run DeepSeek R1 on Just 480GB of VRAM
Brendan McKeag
February 27, 2025

Run DeepSeek R1 on Just 480GB of VRAM

DeepSeek R1 remains one of the top open-source models. This post shows how you can run it efficiently on just 480GB of VRAM without sacrificing performance.

AI Workloads
All
Intro to WebSocket Streaming with Runpod Serverless
Brendan McKeag
February 19, 2025

Intro to WebSocket Streaming with Runpod Serverless

This follow-up to our “Hello World” tutorial walks through streaming output from a Runpod Serverless endpoint using WebSocket and base64 files.

AI Infrastructure
All
How to Run a "Hello World" on Runpod Serverless
Brendan McKeag
February 6, 2025

How to Run a "Hello World" on Runpod Serverless

New to serverless? This guide shows you how to deploy a basic "Hello World" API on Runpod Serverless using Docker—perfect for beginners testing their first worker.

AI Infrastructure
All
Mistral Small 3 Avoids Synthetic Data—Why That Matters
Brendan McKeag
February 1, 2025

Mistral Small 3 Avoids Synthetic Data—Why That Matters

Mistral Small 3 skips synthetic data entirely and still delivers strong performance. Here’s why that decision matters, and what it tells us about future model development.

All
The Complete Guide to GPU Requirements for LLM Fine-Tuning
Brendan McKeag
January 29, 2025

The Complete Guide to GPU Requirements for LLM Fine-Tuning

Fine-tuning large language models can require hours or days of runtime. This guide walks through how to choose the right GPU spec for cost and performance.

All
Poddy mascot displayed as a retro TV with static, indicating no results found
We couldn't find anything. Try a different search.

Build what’s next.

The most cost-effective platform for building, training, and scaling machine learning models—ready when you are.