Announcing Runpod Flash

Runpod Blog.

Our team’s insights on building better
and scaling smarter.

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Announcing Global Networking for Secure Pod-to-Pod Communication Across Data Centers
Brendan McKeag
December 2, 2024

Announcing Global Networking for Secure Pod-to-Pod Communication Across Data Centers

Runpod now supports secure internal communication between pods across data centers. With Global Networking enabled, your pods can talk to each other privately via .runpod.internal—no open ports required.

Product Updates
All
How Much Can a GPU Cloud Save You? A Cost Breakdown vs On-Prem Clusters
James Sandy
November 22, 2024

How Much Can a GPU Cloud Save You? A Cost Breakdown vs On-Prem Clusters

We crunched the numbers: deploying 4x A100s on Runpod’s GPU cloud can save over $124,000 versus an on-prem cluster across 3 years. Learn why cloud beats on-prem for flexibility, cost, and scale.

Cost Optimization
All
Scoped API Keys Now Live: Secure, Fine-Grained Access Control on Runpod
Brendan McKeag
November 18, 2024

Scoped API Keys Now Live: Secure, Fine-Grained Access Control on Runpod

Runpod now supports scoped API keys with per-endpoint access, usage tracking, and on/off toggles. Create safer, more flexible keys that align with the principle of least privilege.

Product Updates
All
When to Use (or Not Use) Runpod's Proxy
Brendan McKeag
November 13, 2024

When to Use (or Not Use) Runpod's Proxy

Wondering when to use Runpod’s built-in proxy system for pod access? This guide breaks down its use cases, limitations, and when direct connection is a better choice.

AI Workloads
All
Quantization Methods Compared: Speed vs. Accuracy in Model Deployment
James Sandy
November 12, 2024

Quantization Methods Compared: Speed vs. Accuracy in Model Deployment

Explore the trade-offs between post-training, quantization-aware training, mixed precision, and dynamic quantization. Learn how each method impacts model speed, memory, and accuracy—and which is best for your deployment needs.

AI Workloads
All
How to Build and Deploy an AI Chatbot from Scratch with Runpod: A Community Project Breakdown
Brendan McKeag
November 6, 2024

How to Build and Deploy an AI Chatbot from Scratch with Runpod: A Community Project Breakdown

Explore how Code in a Jiffy built a fully functional AI-powered coffee shop chatbot using Runpod. This community spotlight covers agentic chatbot structures, full-stack architecture, and how Runpod’s serverless infra simplifies deployment.

Learn AI
All
Classifier-Free Guidance in LLMs: How It Works
Brendan McKeag
November 4, 2024

Classifier-Free Guidance in LLMs: How It Works

Classifier-Free Guidance improves LLM output quality and control. Here’s how it works, where it came from, and why it matters for your AI generations

Learn AI
All
Poddy mascot displayed as a retro TV with static, indicating no results found
We couldn't find anything. Try a different search.

Build what’s next.

The most cost-effective platform for building, training, and scaling machine learning models—ready when you are.