.jpeg)
Multi-Instance GPUs on Runpod: Stop Paying for Compute You Don't Need
With MIG, we can partition RTX 6000 Pro cards into isolated 24 GB instances. Here's when it makes sense for your workloads.
Blog
Our new Serverless CPU offering lets you launch high-performance containers without GPUs—perfect for lighter workloads, dev tasks, and automation.

We are thrilled to introduce the latest addition to the Runpod platform: Serverless CPU. This feature allows you to create high-performance VM containers with up to 3.75 GHz deviated cores, DDR5 memory, and NVME SSD storage.
With Serverless CPU, you have the flexibility to choose between Compute-Optimized or General Purpose CPUs and various configurations to suit your specific needs. Whether you require high compute power for intensive tasks or a balanced setup for general purposes, we've got you covered.
Getting Started:
Why Choose Serverless CPU over Serverless GPU?
While GPUs are well-known for their exceptional performance in parallel processing tasks, such as deep learning and complex computations, there are scenarios where CPUs are more suitable. Here are a few reasons why you might choose Serverless CPU over Serverless GPU:
Embrace the power of Runpod's Serverless CPU and elevate your application's performance today!
For a step-by-step guide on how to get started, check out our tutorial: Run an Ollama Server on a Runpod CPU.
Start Up a CPU Serverless Endpoint
Blog Posts
.jpeg)
With MIG, we can partition RTX 6000 Pro cards into isolated 24 GB instances. Here's when it makes sense for your workloads.

How 1,100 researchers beat OpenAI's own baseline with 16 megabytes and 10 minutes.

Learn how to set up a real-world agentic system with our new Flash framework.