Shaamil Karim

Shaamil Karim

11 July 2024

RAG vs. Fine-Tuning: Which Strategy is Best for Customizing LLMs?

RAG and fine-tuning are two powerful strategies for adapting large language models (LLMs) to domain-specific tasks. This post compares their use cases, performance, and introduces RAFT—an integrated approach that combines the best of both methods for more accurate and adaptable AI models.

Read article

AI Workloads

Shaamil Karim

17 June 2024

Partnering with Defined AI to Bridge the Data Wealth Gap

Runpod and Defined.ai launch a pilot program to provide startups with access to high-quality training data and compute, enabling sector-specific fine-tuning and closing the data wealth gap.

Read article

Product Updates

Shaamil Karim

02 August 2024

How to Run SAM 2 on a Cloud GPU with RunPod

Segment Anything Model 2 (SAM 2) offers real-time segmentation power. This guide walks you through running it efficiently on RunPod’s cloud GPUs.

Read article

AI Workloads

Shaamil Karim

20 August 2024

Run Llama 3.1 with vLLM on RunPod Serverless

Discover how to deploy Meta's Llama 3.1 using RunPod’s new vLLM worker. This guide walks you through model setup, performance benefits, and step-by-step deployment.

Read article

AI Infrastructure

Shaamil Karim

11 July 2024

RAG vs. Fine-Tuning: Which Is Best for Your LLM?

Retrieval-Augmented Generation (RAG) and fine-tuning are powerful ways to adapt large language models. Learn the key differences, trade-offs, and when to use each.

Read article

AI Workloads

Shaamil Karim

13 August 2024

How to Run the FLUX Image Generator with ComfyUI on Runpod

Step-by-step guide for deploying FLUX with ComfyUI on Runpod. Perfect for creators looking to generate high-quality AI images with ease.

Read article

Learn AI

Shaamil Karim

29 July 2024

Run Llama 3.1 405B with Ollama on RunPod: Step-by-Step Deployment

Learn how to deploy Meta’s powerful open-source Llama 3.1 405B model using Ollama on RunPod. With benchmark-crushing performance, this guide walks you through setup and deployment.

Read article

AI Workloads

RAG vs. Fine-Tuning: Which Strategy is Best for Customizing LLMs?

Partnering with Defined AI to Bridge the Data Wealth Gap

How to Run SAM 2 on a Cloud GPU with RunPod

Run Llama 3.1 with vLLM on RunPod Serverless

RAG vs. Fine-Tuning: Which Is Best for Your LLM?

How to Run the FLUX Image Generator with ComfyUI on Runpod

Run Llama 3.1 405B with Ollama on RunPod: Step-by-Step Deployment

Build what’s next.

You’ve unlocked areferral bonus!

You’ve unlocked a
referral bonus!