Brendan McKeag

Runpod RoundUp 2 – 32k Token Context LLMs and New StabilityAI Offerings

This week’s Runpod RoundUp covers major releases including Llama-2 with 32k context support, SDXL 1.0’s public release, and StabilityAI’s new Stable Beluga LLMs—all now available to run on Runpod.
Read article
AI Workloads

Stable Diffusion XL 1.0 Released And Available On Runpod

Stable Diffusion XL 1.0 is now live on Runpod with full support in the Fast Stable Diffusion template. Users can generate higher-resolution, more anatomically accurate, and text-capable images with simplified prompts using AUTOMATIC1111 via a streamlined Jupyter setup.
Read article
AI Workloads

Runpod Roundup: High-Context LLMs, SDXL, and Llama 2

This Runpod Roundup covers the arrival of 8k–16k token context models, the release of Stable Diffusion XL, and the launch of Llama 2 by Meta and Microsoft. All are now available to run on Runpod.
Read article
Hardware & Trends

Meta and Microsoft Release Llama 2 as Open Source

Llama 2 is now open source, offering a native 4k context window and strong performance. This post walks through how to download it from Meta or use TheBloke’s quantized versions.
Read article
Hardware & Trends

How to Install SillyTavern in a Runpod Instance

This guide walks through setting up SillyTavern—a powerful, customizable roleplay frontend—on a Runpod instance. It covers port exposure, GitHub installation, whitelist config, and connecting to models like Oobabooga or KoboldAI.
Read article
Learn AI

16k Context LLM Models Now Available On Runpod

Runpod now supports Panchovix’s 16k-token context models, allowing for much deeper context retention in long-form generation. These models require higher VRAM and may trade off some performance, but are ideal for extended sessions like roleplay or complex Q&A.
Read article
Product Updates

Runpod Partners With Defined.ai To Democratize and Accelerate AI Development

Runpod announces a partnership with Defined.ai to offer ethically sourced speech and text datasets to AI developers, starting with a pilot program to fine-tune LLMs and accelerate NLP research.
Read article
Product Updates

Build what’s next.

The most cost-effective platform for building, training, and scaling machine learning models—ready when you are.

12:22