Announcing Runpod Flash

Runpod Blog.

Our team’s insights on building better
and scaling smarter.

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Mixture of Experts (MoE): A Scalable AI Training Architecture
Alyssa Mazzina
April 23, 2025

Mixture of Experts (MoE): A Scalable AI Training Architecture

MoE models scale efficiently by activating only a subset of parameters. Learn how this architecture works, why it’s gaining traction, and how Runpod supports MoE training and inference.

AI Workloads
All
Runpod Global Networking Expands to 14 More Data Centers
Brendan McKeag
April 22, 2025

Runpod Global Networking Expands to 14 More Data Centers

Runpod’s global networking feature is now available in 14 new data centers, improving latency and accessibility across North America, Europe, and Asia.

AI Infrastructure
All
How to Fine-Tune LLMs with Axolotl on Runpod
James Sandy
April 21, 2025

How to Fine-Tune LLMs with Axolotl on Runpod

Learn how to fine-tune large language models using Axolotl on Runpod. This guide covers LoRA, 8-bit quantization, DeepSpeed, and GPU infrastructure setup.

AI Workloads
All
The RTX 5090 Is Here: Serve 65,000+ Tokens Per Second on Runpod
Alyssa Mazzina
April 15, 2025

The RTX 5090 Is Here: Serve 65,000+ Tokens Per Second on Runpod

The new NVIDIA RTX 5090 is now live on Runpod. With blazing-fast inference speeds and large memory capacity, it’s ideal for real-time LLM workloads and AI scaling.

AI Workloads
All
Cost-Effective AI with Autoscaling on Runpod
James Sandy
April 14, 2025

Cost-Effective AI with Autoscaling on Runpod

Learn how Runpod autoscaling helps teams cut costs and improve performance for both training and inference. Includes best practices and real-world efficiency gains.

AI Workloads
All
The Future of AI Training: Are GPUs Enough?
Alyssa Mazzina
April 10, 2025

The Future of AI Training: Are GPUs Enough?

GPUs still dominate AI training in 2025, but emerging hardware and hybrid infrastructure are reshaping what's possible. Here’s what GTC revealed—and what it means for you.

AI Workloads
All
Llama 4 Scout and Maverick Are Here—How Do They Shape Up?
Brendan McKeag
April 9, 2025

Llama 4 Scout and Maverick Are Here—How Do They Shape Up?

Meta’s Llama 4 models, Scout and Maverick, are the next evolution in open LLMs. This post explores their strengths, performance, and deployment on Runpod.

All
Poddy mascot displayed as a retro TV with static, indicating no results found
We couldn't find anything. Try a different search.

Build what’s next.

The most cost-effective platform for building, training, and scaling machine learning models—ready when you are.