Announcing Runpod Flash

Brendan McKeag

Runpod Achieves SOC 2 Type II Certification: Continuing Our Compliance Journey
Brendan McKeag
October 13, 2025

Runpod Achieves SOC 2 Type II Certification: Continuing Our Compliance Journey

Runpod has officially achieved SOC 2 Type II certification, validating that its enterprise-grade security controls not only meet strict design standards but also operate effectively over time. This milestone proves Runpod’s ongoing commitment to protecting customer data and maintaining trusted, compliant AI infrastructure for enterprises and developers alike.

Product Updates
All
Setting up Slurm on Runpod Clusters: A Technical Guide
Brendan McKeag
September 25, 2025

Setting up Slurm on Runpod Clusters: A Technical Guide

Slurm on Runpod Clusters makes it simple to scale distributed AI and scientific computing across multiple GPU nodes. With pre-configured setup, advanced job scheduling, and built-in monitoring, users can efficiently manage training, batch processing, and HPC workloads while testing connectivity, CUDA availability, and multi-node PyTorch performance.

AI Infrastructure
All
DeepSeek V3.1: A Technical Analysis of Key Changes from V3-0324
Brendan McKeag
August 25, 2025

DeepSeek V3.1: A Technical Analysis of Key Changes from V3-0324

DeepSeek V3.1 introduces a breakthrough hybrid reasoning architecture that dynamically toggles between fast inference and deep chain-of-thought logic using token-controlled templates—enhancing performance, flexibility, and hardware efficiency over its predecessor V3-0324. This update positions V3.1 as a powerful foundation for real-world AI applications, with benchmark gains across math, code, and agent tasks, now fully deployable on Runpod Clusters.

AI Workloads
All
Wan 2.2 Releases With a Plethora Of New Features
Brendan McKeag
August 1, 2025

Wan 2.2 Releases With a Plethora Of New Features

Deploy Wan 2.2 on Runpod to unlock next-gen video generation with Mixture-of-Experts architecture, TI2V-5B support, and 83% more training data—run text-to-video and image-to-video models at scale using A100–H200 GPUs and customizable ComfyUI workflows.

AI Infrastructure
All
Deep Cogito Releases Suite of LLMs Trained with Iterative Policy Improvement
Brendan McKeag
August 1, 2025

Deep Cogito Releases Suite of LLMs Trained with Iterative Policy Improvement

Deploy DeepCogito’s Cogito v2 models on Runpod to experience frontier-level reasoning at lower inference costs—choose from 70B to 671B parameter variants and leverage Runpod’s optimized templates and Clusters for scalable, efficient AI deployment.

AI Infrastructure
All
How to Run MoonshotAI’s Kimi-K2-Instruct on Runpod Instant Cluster
Brendan McKeag
July 25, 2025

How to Run MoonshotAI’s Kimi-K2-Instruct on Runpod Instant Cluster

Run MoonshotAI’s Kimi-K2-Instruct on Runpod Clusters using H200 SXM GPUs and a 2TB shared network volume for seamless multi-node training. This guide shows how to deploy with PyTorch templates, optimize Docker environments, and accelerate LLM inference with scalable, low-latency infrastructure.

AI Workloads
All
Comparing the 5090 to the 4090 and B200: How Does It Stack Up?
Brendan McKeag
July 25, 2025

Comparing the 5090 to the 4090 and B200: How Does It Stack Up?

Benchmark Qwen2.5-Coder-7B-Instruct across NVIDIA’s B200, RTX 5090, and 4090 to identify optimal GPUs for LLM inference—compare token throughput, cost per token, and memory efficiency to match your workload with the right performance tier.

All
Poddy mascot displayed as a retro TV with static, indicating no results found
We couldn't find anything. Try a different search.

Build what’s next.

The most cost-effective platform for building, training, and scaling machine learning models—ready when you are.