
Use alpha_value To Blast Through Context Limits in LLaMa-2 Models
Learn how to extend the context length of LLaMa-2 models beyond their defaults using alpha_value and NTK-aware RoPE scaling—all without sacrificing coherency.

Learn how to extend the context length of LLaMa-2 models beyond their defaults using alpha_value and NTK-aware RoPE scaling—all without sacrificing coherency.

Join Runpod CEO Zhen Lu and Data Science Dojo CEO Raja Iqbal on October 11 for a live fireside chat about GPU-powered AI transformation and the future of scalable machine learning infrastructure.

This guide breaks down everything you need to know about billing on Runpod—how credits are applied, what gets charged, and how to set up automatic or manual funding.

Runpod partners with RandomSeed to power easy-to-use API access for Stable Diffusion through AUTOMATIC1111, making generative art more accessible to developers.

Runpod has partnered with Data Science Dojo to power their Large Language Model bootcamps, providing scalable GPU infrastructure to support hands-on learning in generative AI, embeddings, orchestration frameworks, and deployment.

Falcon-180B is the largest open-source LLM to date, requiring 400GB of VRAM to run unquantized. This post explores how to deploy it on Runpod with A100s, L40s, and quantized alternatives like GGUF for more accessible use.

his week’s roundup covers Alibaba’s vision-language model Qwen-VL, Meta’s new code-focused LLM Code Llama, and FACET—a benchmark for detecting bias in computer vision datasets.

