Blog

Runpod AI Infrastructure Blog

Runpod product updates, AI infrastructure guides, GPU tutorials, and deployment patterns for developers building with cloud GPUs.

How to Code Stable Diffusion Directly in Python on Runpod

Brendan McKeag

October 14, 2024

How to Code Stable Diffusion Directly in Python on Runpod

Skip the front ends, learn how to use Jupyter Notebook on Runpod to run Stable Diffusion directly in Python. Great for devs who want full control.

AI Workloads

Why LLMs Can't Spell 'Strawberry' And Other Odd Use Cases

Brendan McKeag

October 1, 2024

Why LLMs Can't Spell 'Strawberry' And Other Odd Use Cases

Large language models can write poetry and solve logic puzzles, but fail at tasks like counting letters or doing math. Here's why, and what it tells us.

Learn AI

Run GGUF Quantized Models Easily with KoboldCPP on Runpod

Brendan McKeag

September 25, 2024

Run GGUF Quantized Models Easily with KoboldCPP on Runpod

Lower VRAM usage and improve inference speed using GGUF quantized models in KoboldCPP with just a few environment variables.

AI Workloads

How to Work with GGUF Quantizations in KoboldCPP

Brendan McKeag

September 25, 2024

How to Work with GGUF Quantizations in KoboldCPP

GGUF quantizations make large language models faster and more efficient. This guide walks you through using KoboldCPP to load, run, and manage quantized.

Learn AI

Introducing Better Forge: Spin Up Stable Diffusion Pods Faster

Brendan McKeag

September 20, 2024

Introducing Better Forge: Spin Up Stable Diffusion Pods Faster

Better Forge is a new Runpod template that lets you launch Stable Diffusion pods in less time and with less hassle. Here's how it improves your workflow.

AI Infrastructure