Blog

Runpod Blog

Our team’s insights on building better and scaling smarter.
All
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Stable Diffusion 3.5: What’s New in the Latest Generation

Stability.ai’s SD3.5 is here—with new models built for speed and quality. Learn what’s changed, what’s improved, and how to run it on Runpod.
Read article
Product Updates

Why NVidia's Llama 3.1 Nemotron 70B Might Be the Most Reasonable LLM Yet

NVidia’s Llama 3.1 Nemotron 70B is outperforming larger and closed models on key reasoning tasks. In this post, Brendan tests it against a long-unsolved challenge: consistent, in-character roleplay with zero internal monologue or user coercion—and finds it finally up to the task.
Read article
AI Workloads

NVIDIA's Llama 3.1 Nemotron 70B: Can It Solve Your LLM Bottlenecks?

Nemotron 70B is NVIDIA’s latest open model and it’s climbing the leaderboards. But how does it perform in the real world—and can it solve your toughest inference challenges?
Read article
Hardware & Trends

How to Code Stable Diffusion Directly in Python on RunPod

Skip the front ends—learn how to use Jupyter Notebook on RunPod to run Stable Diffusion directly in Python. Great for devs who want full control.
Read article
AI Workloads

Why LLMs Can't Spell 'Strawberry' And Other Odd Use Cases

Large language models can write poetry and solve logic puzzles—but fail at tasks like counting letters or doing math. Here’s why, and what it tells us about their design.
Read article
Learn AI

Run GGUF Quantized Models Easily with KoboldCPP on Runpod

Lower VRAM usage and improve inference speed using GGUF quantized models in KoboldCPP with just a few environment variables.
Read article
AI Workloads

How to Work with GGUF Quantizations in KoboldCPP

GGUF quantizations make large language models faster and more efficient. This guide walks you through using KoboldCPP to load, run, and manage quantized LLMs on Runpod.
Read article
Learn AI

Build what’s next.

The most cost-effective platform for building, training, and scaling machine learning models—ready when you are.