Brendan McKeag

Use alpha_value To Blast Through Context Limits in LLaMa-2 Models

Brendan McKeag

October 10, 2023

Use alpha_value To Blast Through Context Limits in LLaMa-2 Models

Learn how to extend the context length of LLaMa-2 models beyond their defaults using alpha_value and NTK-aware RoPE scaling—all without sacrificing coherency.

AI Workloads

Save the Date October 11th, 2:00 PM EST: Fireside Chat With Runpod CEO Zhen Lu And Data Science Dojo CEO Raja Iqbal On GPU-Powered AI Transformation

Brendan McKeag

October 8, 2023

Save the Date October 11th, 2:00 PM EST: Fireside Chat With Runpod CEO Zhen Lu And Data Science Dojo CEO Raja Iqbal On GPU-Powered AI Transformation

Join Runpod CEO Zhen Lu and Data Science Dojo CEO Raja Iqbal on October 11 for a live fireside chat about GPU-powered AI transformation and the future of scalable machine learning infrastructure.

How to Manage Funding Your Runpod Account

Brendan McKeag

October 1, 2023

How to Manage Funding Your Runpod Account

This guide breaks down everything you need to know about billing on Runpod—how credits are applied, what gets charged, and how to set up automatic or manual funding.

Cost Optimization

Runpod Partners With RandomSeed to Provide Accessible, User-Friendly Stable Diffusion API Access

Brendan McKeag

September 22, 2023

Runpod Partners With RandomSeed to Provide Accessible, User-Friendly Stable Diffusion API Access

Runpod partners with RandomSeed to power easy-to-use API access for Stable Diffusion through AUTOMATIC1111, making generative art more accessible to developers.

Product Updates

Runpod Partners with Data Science Dojo To Provide Compute For LLM Bootcamps

Brendan McKeag

September 20, 2023

Runpod Partners with Data Science Dojo To Provide Compute For LLM Bootcamps

Runpod has partnered with Data Science Dojo to power their Large Language Model bootcamps, providing scalable GPU infrastructure to support hands-on learning in generative AI, embeddings, orchestration frameworks, and deployment.

Product Updates

What You'll Need to Run Falcon 180B In a Pod

Brendan McKeag

September 7, 2023

What You'll Need to Run Falcon 180B In a Pod

Falcon-180B is the largest open-source LLM to date, requiring 400GB of VRAM to run unquantized. This post explores how to deploy it on Runpod with A100s, L40s, and quantized alternatives like GGUF for more accessible use.

AI Infrastructure

Runpod Roundup 5 – Visual/Language Comprehension, Code-Focused LLMs, and Bias Detection

Brendan McKeag

August 31, 2023

Runpod Roundup 5 – Visual/Language Comprehension, Code-Focused LLMs, and Bias Detection

his week’s roundup covers Alibaba’s vision-language model Qwen-VL, Meta’s new code-focused LLM Code Llama, and FACET—a benchmark for detecting bias in computer vision datasets.

AI Workloads

Poddy mascot displayed as a retro TV with static, indicating no results found

We couldn't find anything. Try a different search.

Build what’s next.

The most cost-effective platform for building, training, and scaling machine learning models—ready when you are.

Get started