Heading 1

Heading 2

Heading 3

Heading 4

Heading 5

Heading 6

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.

Block quote

Ordered list

Item 1
Item 2
Item 3

Unordered list

Item A
Item B
Item C

Text link

Bold text

Emphasis

^Superscript

_Subscript

We're officially SOC 2 Type II Compliant

You've unlocked a referral bonus! Sign up today and you'll get a random credit bonus between $5 and $500

You've unlocked a referral bonus!

Claim Your Bonus

Claim Bonus

Blog

Run Hugging Face spaces on Runpod!

Learn how to deploy any Hugging Face Space on Runpod using Docker, including an example with Kokoro TTS and Gradio.

Author

River Snow

Date

May 17, 2023

Table of contents

TOC

Share

Get started

Hugging Face Spaces are interactive demos that showcase AI models directly on the Hugging Face platform. They're great for experimenting with AI capabilities, but what if you want more computing power or need to run these models in your own environment? Or you want to use them as much as you want to without being rate limited?

Good news! Every Hugging Face Space can now be run using Docker, which means you can deploy them on platforms like Runpod to leverage powerful GPUs. In this guide, we'll walk through deploying Kokoro TTS (a Text-to-Speech model) via Gradio from Hugging Face to Runpod.

What is Gradio?

Gradio is a popular Python library that creates user-friendly interfaces for machine learning models. Many Hugging Face Spaces, including Kokoro TTS, use Gradio to provide an interactive web interface where you can test the model's capabilities through your browser. By the end of this tutorial, you'll have this same interface running on your Runpod instance.

Why Kokoro TTS?

We've chosen Kokoro TTS for this example because it's a powerful text-to-speech model that benefits from GPU acceleration. This makes it a perfect candidate to demonstrate how to move from Hugging Face Spaces to Runpod's more flexible, accessible computing environment.

Prerequisites

A Hugging Face account (to generate an access token)
A Runpod account with payment method set up

Setup

First, go to the Kokoro TTS Space on Hugging Face. In the upper right you'll see a pullout menu with three dots. Click Run Locally to get more info about the Docker image that drives the Space.

Copy this Docker command down, as it contains variables that we will need later.

In addition, you will need an access token, which you can get from your Settings page.

Log in to your Hugging Face account.
Go to https://huggingface.co/settings/tokens
Click New Token
Name your token (e.g., "Runpod Access") and select appropriate permissions
Click Generate Token and copy the generated token to a secure location

Configure your Template

Now, go to Templates in the nav bar on Runpod, and click New Template.

Set the Container image as the container image specified in the docker container. In our case this is registry.hf.space/hexgrad-kokoro-tts:latest, which we got from point 3 above
Set the "Expose HTTP Ports" to 7860, since that's the port being exposed in the Docker command above.
Enter bash -c "python app.py" as your container start command.
Open "Environment variables" in the bottom of the template, and set it to:
- key : HUGGING_FACE_HUB_TOKEN
- value : (your huggingface hub token)

Deploy a Pod with Your Template

Navigate to the Pods section in the left navigation bar
Click the Deploy button
Select a GPU type (H100 works well due to high VRAM, but you can experiment with less expensive options)
Scroll down and click Change Template
Select the template you just created ("Kokoro TTS")
Review your settings and click Deploy at the bottom

Connect to your Pod

After deploying, you'll be taken to the Pods screen
Click on your newly created pod to view details
Check the Logs tab to monitor startup progress
Wait until you see a message indicating the service is running on port 7860
Once ready, click the Connect button on your pod
Select the HTTP Service option (usually has port 7860)

Review the logs in your deployed pods under the Pods screen, and wait for a notification to appear that the image is up and running on port 7860.

And ta-da!, you should see your deployment within Runpod!

Conclusion

You've successfully deployed a Hugging Face Space on Runpod! This approach works for virtually any Hugging Face Space - just repeat these steps with the appropriate Docker image and port. Runpod gives you the flexibility to choose more powerful hardware when needed, allowing you to run more demanding models than what's possible directly on Hugging Face.

‍

Build what’s next.

The most cost-effective platform for building, training, and scaling machine learning models—ready when you are.

Get started ->

Request a demo

Run Hugging Face spaces on Runpod!

Heading 1

Heading 2

Heading 3

Heading 4

Heading 5

Heading 6

What is Gradio?

Why Kokoro TTS?

Prerequisites

Setup

Configure your Template

Deploy a Pod with Your Template

Connect to your Pod

Conclusion

Run Hugging Face spaces on Runpod!

What is Gradio?

Why Kokoro TTS?

Prerequisites

Setup

Configure your Template

Deploy a Pod with Your Template

Connect to your Pod

Conclusion

A1111 Serverless API – Step-by-Step Video Tutorial

Easy LLM Fine-Tuning on RunPod: Axolotl Made Simple

How to Run LTXVideo in ComfyUI on Runpod

Build what’s next.

Run Hugging Face spaces on Runpod!

Heading 1

Heading 2

Heading 3

Heading 4

Heading 5

Heading 6

What is Gradio?

Why Kokoro TTS?

Prerequisites

Setup

Configure your Template

Deploy a Pod with Your Template

Connect to your Pod

Conclusion

Run Hugging Face spaces on Runpod!

What is Gradio?

Why Kokoro TTS?

Prerequisites

Setup

Configure your Template

Deploy a Pod with Your Template

Connect to your Pod

Conclusion

Related articles.

A1111 Serverless API – Step-by-Step Video Tutorial

Easy LLM Fine-Tuning on RunPod: Axolotl Made Simple

How to Run LTXVideo in ComfyUI on Runpod

Build what’s next.

You’ve unlocked areferral bonus!

You’ve unlocked a
referral bonus!