Blog

Use DeepFloyd To Create Actual English Text Within AI!

Tired of AI gibberish in your generated images? Learn how to use DeepFloyd on Runpod to generate real English text within images, with guidance from Bill.

Use DeepFloyd To Create Actual English Text Within AI!

If you've ever tried to generate text in images in packages like Stable Diffusion, you're probably familiar with the positively haunting facsimile of language it manages to produce. It looks so much like it could be a real language, and ultimately manages to be gibberish - something like you might actually see in a dream.

Screenshot from DeepFloyd text generation tutorial

As entertaining as it can be at times, it's probably not what you want to see in a serious art project. Runpod is proud to offer a solution through DeepFloyd, as portrayed in a new video by Bill Meeks, which can be run directly within a pod on the platform.

Setting up DeepFloyd on Runpod

First, you'll need to download a customized Jupyter notebook created by Bill Meeks, which can be found here.

Then, you'll want to spin up a reasonably powerful pod (such as an A40, at the very least) and create it using the Pytorch 2 template. Even an A40 will take a few minutes per image set, so you'll want to use a healthy GPU to be sure that you don't run into CUDA errors or other delays. You'll also want to edit the template override and give the pod at least 50gb for both the volume and the container.

Once you have the Jupyter notebook loaded, upload the custom notebook that you downloaded earlier by clicking the upward facing arrow in the upper left corner.

Then, run through each cell in the notebook by pressing Ctrl+Enter, moving to the next one after they complete.

If you are prompted for a Huggingface token, you can create one under the Settings -> Access Tokens page on HF. You'll also need to go to the DeepFloyd page and accept their licensing agreement.

Running the model

To run the model, scroll down further into the notebook past the initial setup and you'll see several cells where you can go in and change the text in the image.

As with Stable Diffusion in general, the goal should be to get an image that is close to what you are looking for, and then let manual editing carry you through any curveballs that it may throw in your way. It'll likely take a few iterations to get close to where you want to be, but it can get pretty darn close on its own with some patience.

AI-generated image of a bear at a protest holding a sign reading THIS IS UBBEARABLE

Don't forget to check out Bill's very in depth Youtube guide for further insights on how to run DeepFloyd on Runpod!

Deploy When Available is now GA

Queue for any GPU spec, even one that's fully rented out, and we'll deploy it the moment capacity opens up. No more refreshing the console or running a sniping tool.

The Chips Got Faster. The Stack Didn't.

Explore why faster chips have shifted the bottleneck to AI infrastructure, and what that means for teams running production workloads.

Multi-Instance GPUs on Runpod: Stop Paying for Compute You Don't Need

With MIG, we can partition RTX 6000 Pro cards into isolated 24 GB instances. Here's when it makes sense for your workloads.

Build what’s next.

Build, train, and scale AI workloads on Runpod with cloud GPUs, Serverless, and Clusters.

Get started