Announcing Runpod Flash

Runpod Blog.

Our team’s insights on building better
and scaling smarter.

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
How to Install SillyTavern in a Runpod Instance
Brendan McKeag
July 20, 2023

How to Install SillyTavern in a Runpod Instance

This guide walks through setting up SillyTavern—a powerful, customizable roleplay frontend—on a Runpod instance. It covers port exposure, GitHub installation, whitelist config, and connecting to models like Oobabooga or KoboldAI.

Learn AI
All
How to Install SillyTavern in a Runpod Instance
Brendan McKeag
July 20, 2023

How to Install SillyTavern in a Runpod Instance

Want to upgrade from basic chat UIs? SillyTavern offers a more interactive interface for AI conversations. Here’s how to install it on your own Runpod instance.

Learn AI
All
16k Context LLM Models Now Available On Runpod
Brendan McKeag
July 19, 2023

16k Context LLM Models Now Available On Runpod

Runpod now supports Panchovix’s 16k-token context models, allowing for much deeper context retention in long-form generation. These models require higher VRAM and may trade off some performance, but are ideal for extended sessions like roleplay or complex Q&A.

Product Updates
All
Runpod Partners With Defined.ai To Democratize and Accelerate AI Development
Brendan McKeag
July 18, 2023

Runpod Partners With Defined.ai To Democratize and Accelerate AI Development

Runpod announces a partnership with Defined.ai to offer ethically sourced speech and text datasets to AI developers, starting with a pilot program to fine-tune LLMs and accelerate NLP research.

Product Updates
All
How to Use 65B+ Language Models on Runpod
Brendan McKeag
July 10, 2023

How to Use 65B+ Language Models on Runpod

Large language models like Guanaco 65B can run on Runpod with the right optimizations. Learn how to handle quantization, memory, and GPU sizing.

AI Infrastructure
All
SuperHot 8k Token Context Models Are Here For Text Generation
Brendan McKeag
July 7, 2023

SuperHot 8k Token Context Models Are Here For Text Generation

New 8k context models from TheBloke—like WizardLM, Vicuna, and Manticore—allow longer, more immersive text generation in Oobabooga. With more room for character memory and story progression, these models enhance AI storytelling.

All
Worker | Local API Server Introduced with runpod-python 0.10.0
Justin Merrell
July 1, 2023

Worker | Local API Server Introduced with runpod-python 0.10.0

Starting with runpod-python 0.10.0, you can launch a local API server for testing your worker handler using --rp_serve_api. This feature improves the development workflow by letting you simulate interactive API requests before deploying to serverless.

Product Updates
All
Poddy mascot displayed as a retro TV with static, indicating no results found
We couldn't find anything. Try a different search.

Build what’s next.

The most cost-effective platform for building, training, and scaling machine learning models—ready when you are.