Brendan McKeag

Brendan McKeag

27 February 2025

Run DeepSeek R1 on Just 480GB of VRAM

DeepSeek R1 remains one of the top open-source models. This post shows how you can run it efficiently on just 480GB of VRAM without sacrificing performance.

Read article

AI Workloads

Brendan McKeag

19 March 2025

Easy LLM Fine-Tuning on RunPod: Axolotl Made Simple

RunPod now supports Axolotl out of the box—making it easier than ever to fine-tune large language models without complex setup.

Read article

AI Workloads

Brendan McKeag

13 November 2024

When to Use (or Not Use) RunPod's Proxy

Wondering when to use RunPod’s built-in proxy system for pod access? This guide breaks down its use cases, limitations, and when direct connection is a better choice.

Read article

AI Workloads

Brendan McKeag

18 September 2024

Run Very Large LLMs Securely with RunPod Serverless

Deploy large language models like LLaMA or Mixtral on RunPod Serverless with strong privacy controls and no infrastructure headaches. Here’s how.

Read article

AI Infrastructure

Brendan McKeag

18 October 2024

NVIDIA's Llama 3.1 Nemotron 70B: Can It Solve Your LLM Bottlenecks?

Nemotron 70B is NVIDIA’s latest open model and it’s climbing the leaderboards. But how does it perform in the real world—and can it solve your toughest inference challenges?

Read article

Hardware & Trends

Brendan McKeag

24 October 2024

Stable Diffusion 3.5: What’s New in the Latest Generation

Stability.ai’s SD3.5 is here—with new models built for speed and quality. Learn what’s changed, what’s improved, and how to run it on Runpod.

Read article

Product Updates

Brendan McKeag

25 September 2024

How to Work with GGUF Quantizations in KoboldCPP

GGUF quantizations make large language models faster and more efficient. This guide walks you through using KoboldCPP to load, run, and manage quantized LLMs on Runpod.

Read article

Learn AI

Run DeepSeek R1 on Just 480GB of VRAM

Easy LLM Fine-Tuning on RunPod: Axolotl Made Simple

When to Use (or Not Use) RunPod's Proxy

Run Very Large LLMs Securely with RunPod Serverless

NVIDIA's Llama 3.1 Nemotron 70B: Can It Solve Your LLM Bottlenecks?

Stable Diffusion 3.5: What’s New in the Latest Generation

How to Work with GGUF Quantizations in KoboldCPP

Build what’s next.

You’ve unlocked areferral bonus!

You’ve unlocked a
referral bonus!