Heading 1

Heading 2

Heading 3

Heading 4

Heading 5

Heading 6

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.

Block quote

Ordered list

Item 1
Item 2
Item 3

Unordered list

Item A
Item B
Item C

Text link

Bold text

Emphasis

^Superscript

_Subscript

We're officially SOC 2 Type II Compliant

You've unlocked a referral bonus! Sign up today and you'll get a random credit bonus between $5 and $500

You've unlocked a referral bonus!

Claim Your Bonus

Claim Bonus

Blog

Run Your Own AI from Your iPhone Using Runpod

Want to run open-source AI models from your phone? This guide shows how to launch a pod on Runpod and connect to it from your iPhone—no laptop required.

Author

Chen Wong

Date

May 27, 2025

Table of contents

TOC

Share

Get started

Run Your Own AI from Your iPhone Using Runpod

Cell phones have provided users access to an AI such as iPhone’s Siri. With the emergence of cloud-based open-source LLMs, you can now run a personalized AI on your iPhone with Runpod’s offerings. Runpod allows you to have the resources to run the various (and very large) open source LLMs as well as fine tune them for customized needs.

In this tutorial, you will learn how to deploy a model on Runpod with Ollama and use the Shortcuts app on your iPhone to connect with the model. That’s right, you do not need to code and publish an app. When finished, you’ll be able to open the Shortcuts app, speak to it, and receive the dictated message from your new AI.

Prerequisites

The tutorial assumes you have a Runpod account with credits and a device running iOS 15 or later. No other prior knowledge is needed to complete this tutorial.

Step 1: Start a PyTorch Template on Runpod

You will create a new Pod with the PyTorch template. In this step, you will set overrides to configure Ollama.

Log in to your Runpod account and choose + GPU Pod.
Choose a GPU Pod like A40.
From the available templates, select the latest PyTorch template.
Select Customize Deployment.
1. Add the port 11434 to the list of exposed HTTP ports. This port is used by Ollama for HTTP API requests.
2. Add the following environment variable to your Pod to allow Ollama to bind to the HTTP port:
  - Key: OLLAMA_HOST
  - Value: 0.0.0.0
Select Set Overrides, Continue, then Deploy.

This setting configures Ollama to listen on all network interfaces, enabling external access through the exposed port. For detailed instructions on setting environment variables, refer to the Ollama FAQ documentation.

Once the Pod is up and running, you'll have access to a terminal within the Runpod interface.

Step 2: Install Ollama

Now that your Pod is running, you can log in to the web terminal. The web terminal is a powerful way to interact with your Pod.

Select Connect and choose Start Web Terminal.
Make note of the Username and Password, then select Connect to Web Terminal.
Enter your username and password.
To ensure Ollama can automatically detect and utilize your GPU, run the following commands.

‍

Run the following command to install Ollama and send to the background:

This command fetches the Ollama installation script and executes it, setting up Ollama on your Pod. The ollama serve part starts the Ollama server, making it ready to serve AI models. Note that when the web terminal closes, the server will too — so once you're up to speed, you may want to run it in tmux, Jupyter Notebook, or some other method that keeps the server open persistently.

Now that your Ollama server is running on your Pod, add a model.

Step 3: Run an AI Model with Ollama

To run an AI model using Ollama, pass the model name to the ollama run command:

‍Replace [model name] with the name of the AI model you wish to deploy. For a complete list of models, see the Ollama Library.

This command pulls the model and runs it, making it accessible for inference. You can now begin interacting with the model directly from your iPhone.

On the Runpod interface, you can click Connect for your pod followed by clicking HTTP Service to get the URL (ex: https://cwjcj767dd2auh-11434.proxy.runpod.net) to connect from your iPhone.

Step 4: Interact with Ollama via Shortcuts App

Open the Shortcuts app on your iPhone and follow these steps to build the Shortcut.

Open Shortcuts App. Tap + (top-right) to create a new Shortcut.
Record Audio: • Search for “Record Audio” and add it.

Transcribe Audio: Search for “Transcribe Audio” and add it. Add “Recorded Audio”.

Search for “Set variable”. Add “Transcribed Text” and “Transcribed Audio".

Search for “Text”. Add the url of the pod launched in Runpod and make sure to add /api/generate to the end.

Search for “Set variable”. Add “OllamaURL” and set Text.

Search for “Get contents of". Add OllamaURL. Set the parameters like shown below. Change the model value to what you are running in your Pod.

Search for “Get Dictionary Value”. Add Value, response, and Contents of URL.

Search for “Speak”. Add Dictionary Value.

Now you can press Play on the bottom right to start talking to your new AI. Additionally, you can click the info button and add an icon to the Home Screen or even share it with other people.

Conclusion

In this tutorial, you built an AI that you can talk to using your phone without writing any code. You can share it with your friends and family. With Runpod, you can leverage the resources to run models of all different sizes.

Next Steps

Consider enhancing your AI by:

Add support for an access token: This adds a layer of security to who can speak to your AI.
Try out different models: Runpod offers GPU resources of many sizes to handle models with billions of parameters.
Fine tune a model: Teach a model new information, and let it learn what you want.

‍

Build what’s next.

The most cost-effective platform for building, training, and scaling machine learning models—ready when you are.

Get started ->

Request a demo

Run Your Own AI from Your iPhone Using Runpod

Heading 1

Heading 2

Heading 3

Heading 4

Heading 5

Heading 6

Prerequisites

Step 1: Start a PyTorch Template on Runpod

Step 2: Install Ollama

Step 3: Run an AI Model with Ollama

Step 4: Interact with Ollama via Shortcuts App

Conclusion

Next Steps

Run Your Own AI from Your iPhone Using Runpod

Prerequisites

Step 1: Start a PyTorch Template on Runpod

Step 2: Install Ollama

Step 3: Run an AI Model with Ollama

Step 4: Interact with Ollama via Shortcuts App

Conclusion

Next Steps

Open Source Video & LLM Roundup: The Best of What’s New

Built on RunPod: How Cogito Trained Models Toward ASI

Why AI Needs GPUs: A No-Code Beginner’s Guide to Infrastructure

Build what’s next.

Run Your Own AI from Your iPhone Using Runpod

Heading 1

Heading 2

Heading 3

Heading 4

Heading 5

Heading 6

Prerequisites

Step 1: Start a PyTorch Template on Runpod

Step 2: Install Ollama

Step 3: Run an AI Model with Ollama

Step 4: Interact with Ollama via Shortcuts App

Conclusion

Next Steps

Run Your Own AI from Your iPhone Using Runpod

Prerequisites

Step 1: Start a PyTorch Template on Runpod

Step 2: Install Ollama

Step 3: Run an AI Model with Ollama

Step 4: Interact with Ollama via Shortcuts App

Conclusion

Next Steps

Related articles.

Open Source Video & LLM Roundup: The Best of What’s New

Built on RunPod: How Cogito Trained Models Toward ASI

Why AI Needs GPUs: A No-Code Beginner’s Guide to Infrastructure

Build what’s next.

You’ve unlocked areferral bonus!

You’ve unlocked a
referral bonus!