Runpod Blog

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Brendan McKeag

24 October 2024

Stable Diffusion 3.5: What’s New in the Latest Generation

Stability.ai’s SD3.5 is here—with new models built for speed and quality. Learn what’s changed, what’s improved, and how to run it on Runpod.

Read article

Product Updates

Brendan McKeag

18 October 2024

Why NVidia's Llama 3.1 Nemotron 70B Might Be the Most Reasonable LLM Yet

NVidia’s Llama 3.1 Nemotron 70B is outperforming larger and closed models on key reasoning tasks. In this post, Brendan tests it against a long-unsolved challenge: consistent, in-character roleplay with zero internal monologue or user coercion—and finds it finally up to the task.

Read article

AI Workloads

Brendan McKeag

18 October 2024

NVIDIA's Llama 3.1 Nemotron 70B: Can It Solve Your LLM Bottlenecks?

Nemotron 70B is NVIDIA’s latest open model and it’s climbing the leaderboards. But how does it perform in the real world—and can it solve your toughest inference challenges?

Read article

Hardware & Trends

Brendan McKeag

14 October 2024

How to Code Stable Diffusion Directly in Python on RunPod

Skip the front ends—learn how to use Jupyter Notebook on RunPod to run Stable Diffusion directly in Python. Great for devs who want full control.

Read article

AI Workloads

Brendan McKeag

01 October 2024

Why LLMs Can't Spell 'Strawberry' And Other Odd Use Cases

Large language models can write poetry and solve logic puzzles—but fail at tasks like counting letters or doing math. Here’s why, and what it tells us about their design.

Read article

Learn AI

Brendan McKeag

25 September 2024

Run GGUF Quantized Models Easily with KoboldCPP on Runpod

Lower VRAM usage and improve inference speed using GGUF quantized models in KoboldCPP with just a few environment variables.

Read article

AI Workloads

Brendan McKeag

25 September 2024

How to Work with GGUF Quantizations in KoboldCPP

GGUF quantizations make large language models faster and more efficient. This guide walks you through using KoboldCPP to load, run, and manage quantized LLMs on Runpod.

Read article

Learn AI

Stable Diffusion 3.5: What’s New in the Latest Generation

Why NVidia's Llama 3.1 Nemotron 70B Might Be the Most Reasonable LLM Yet

NVIDIA's Llama 3.1 Nemotron 70B: Can It Solve Your LLM Bottlenecks?

How to Code Stable Diffusion Directly in Python on RunPod

Why LLMs Can't Spell 'Strawberry' And Other Odd Use Cases

Run GGUF Quantized Models Easily with KoboldCPP on Runpod

How to Work with GGUF Quantizations in KoboldCPP

Build what’s next.