Blog

Runpod AI Infrastructure Blog

Runpod product updates, AI infrastructure guides, GPU tutorials, and deployment patterns for developers building with cloud GPUs.

Serverless | Migrating and Deploying Cog Images on Runpod Serverless from Replicate

Justin Merrell

October 12, 2023

Serverless | Migrating and Deploying Cog Images on Runpod Serverless from Replicate

A step-by-step guide to migrating a Cog image from Replicate to a Runpod Serverless endpoint using Docker and the cog-worker repo.

AI Workloads

Use alpha_value To Blast Through Context Limits in LLaMa-2 Models

Brendan McKeag

October 10, 2023

Use alpha_value To Blast Through Context Limits in LLaMa-2 Models

Learn how to extend the context length of LLaMa-2 models beyond their defaults using alpha_value and NTK-aware RoPE scaling, all without sacrificing coherency.

AI Workloads

GPU-Powered AI Transformation Fireside Chat

Brendan McKeag

October 8, 2023

GPU-Powered AI Transformation Fireside Chat

Join Runpod CEO Zhen Lu and Data Science Dojo CEO Raja Iqbal on October 11 for a live fireside chat about GPU-powered AI transformation and the future of.

How to Manage Funding Your Runpod Account

Brendan McKeag

October 1, 2023

How to Manage Funding Your Runpod Account

This guide breaks down everything you need to know about billing on Runpod, how credits are applied, what gets charged, and how to set up automatic or.

Cost Optimization

Runpod and RandomSeed Bring Stable Diffusion API Access

Brendan McKeag

September 22, 2023

Runpod and RandomSeed Bring Stable Diffusion API Access

Runpod partners with RandomSeed to power easy-to-use API access for Stable Diffusion through AUTOMATIC1111, making generative art more accessible to developers.

Product Updates

Runpod Partners with Data Science Dojo To Provide Compute For LLM Bootcamps

Brendan McKeag

September 20, 2023

Runpod Partners with Data Science Dojo To Provide Compute For LLM Bootcamps

Runpod has partnered with Data Science Dojo to power their Large Language Model bootcamps, providing scalable GPU infrastructure to support hands-on.

Product Updates

Pardeep Singh

September 8, 2023

Runpod Serverless Pricing Update

Runpod introduces new Serverless pricing with Flex and Active worker types, offering better scalability and up to 40% lower costs for consistent workloads.

Product Updates

Poddy mascot displayed as a retro TV with static, indicating no results found

We couldn't find anything. Try a different search.

Build what’s next.

Build, train, and scale AI workloads on Runpod with cloud GPUs, Serverless, and Clusters.

Get started