James Sandy

James Sandy

06 November 2025

How to Run Serverless AI and ML Workloads on Runpod

Learn how to train, deploy, and scale AI/ML models using Runpod Serverless. This guide covers real-world examples, deployment best practices, and how serverless is unlocking new possibilities like real-time video generation.

Read article

Product Updates

James Sandy

22 November 2024

How Much Can a GPU Cloud Save You? A Cost Breakdown vs On-Prem Clusters

We crunched the numbers: deploying 4x A100s on Runpod’s GPU cloud can save over $124,000 versus an on-prem cluster across 3 years. Learn why cloud beats on-prem for flexibility, cost, and scale.

Read article

Cost Optimization

James Sandy

12 November 2024

Quantization Methods Compared: Speed vs. Accuracy in Model Deployment

Explore the trade-offs between post-training, quantization-aware training, mixed precision, and dynamic quantization. Learn how each method impacts model speed, memory, and accuracy—and which is best for your deployment needs.

Read article

AI Workloads

James Sandy

21 April 2025

How to Fine-Tune LLMs with Axolotl on RunPod

Learn how to fine-tune large language models using Axolotl on RunPod. This guide covers LoRA, 8-bit quantization, DeepSpeed, and GPU infrastructure setup.

Read article

AI Workloads

James Sandy

14 April 2025

Cost-Effective AI with Autoscaling on RunPod

Learn how RunPod autoscaling helps teams cut costs and improve performance for both training and inference. Includes best practices and real-world efficiency gains.

Read article

AI Workloads

James Sandy

18 March 2025

Deploying Multimodal Models on RunPod

Multimodal models handle more than just text—they process images, audio, and more. This guide shows how to deploy and scale them using RunPod’s infrastructure.

Read article

AI Workloads

How to Run Serverless AI and ML Workloads on Runpod

How Much Can a GPU Cloud Save You? A Cost Breakdown vs On-Prem Clusters

Quantization Methods Compared: Speed vs. Accuracy in Model Deployment

How to Fine-Tune LLMs with Axolotl on RunPod

Cost-Effective AI with Autoscaling on RunPod

Deploying Multimodal Models on RunPod

Build what’s next.

You’ve unlocked areferral bonus!

You’ve unlocked a
referral bonus!