Open in app
Cover art for How a powerful AI runs on your phone without internet

How a powerful AI runs on your phone without internet

Technology · 6 min listen

Get the app on mobile
Download on the App Store Get it on Google Play
Cover art for How a powerful AI runs on your phone without internet
0:00
0:00
Transcript

HostIt's pretty wild to think about how much stuff is tucked away on the internet. All those books and websites and photos, and now these huge ai models that seem to know everything. But usually, all that brain power lives in a giant room full of humming fans and huge computers, far away from us. We just send a text or a question through the air and wait for it to come back. It feels like magic, but it really depends on having a good signal. So, how are people starting to fit those massive digital brains right onto a phone where they can work even if you're in the middle of the woods with no bars at all?

GuestIt really is a massive task. When you look at the big models everyone talks about, they're basically made of billions and billions of numbers. Think of it like a giant map where every single turn and every single street is a tiny piece of math. To run that on a big computer in a warehouse, you need tons of power and a lot of space. But your phone is small, it gets hot easily, and it runs on a battery that you want to last all day. The first thing we have to do is find a way to make that giant map much, much smaller without losing the way home. We do this with a trick that's sort of like rounding your taxes. Instead of saying you owe ten dollars and fifty-two cents, you just say ten dollars. In the ai world, we take those very long, very exact numbers and we shorten them. We call it rounding off or squeezing the math. It turns out, if you do it right, the ai can still make a very good guess even if the numbers aren't perfect.

HostBut if you're rounding everything off and making the math less exact, does that not just make the ai a lot dumber? I mean, if I'm trying to write something important or solve a problem, I don't want the lazy version of the brain.

GuestYou would think so, right? It feels like you should be losing a ton of quality. But it's a lot like looking at a photo on your phone versus a giant print on a wall. On the phone, the photo is much smaller and has less detail if you zoom way in, but to your eyes, it looks just as sharp. We found out that these models are actually full of a lot of fluff. There are millions of connections in that digital brain that don't actually do much of anything. So, along with rounding the numbers, we also just cut out the parts that aren't being used. We call it pruning, like trimming a hedge. You snip away the dead branches and the stuff that's just taking up space. By the time we're done, the model might be ten or twenty times smaller than it started, but it still acts almost exactly the same as the big one. It's still the same brain, it's just much leaner.

HostOkay, so we make the brain smaller and leaner. But even a lean brain needs a place to work. My phone already gets hot just playing a game or watching a video. How does the actual phone part keep up with all those calculations without just melting in my hand?

GuestWell, that's where the physical parts of the phone come in. For a long time, the main chip in your phone was a jack of all trades. It was good at opening apps and showing your screen and checking your mail. But it wasn't built for this specific kind of heavy lifting. Now, phone makers are putting in a special kind of chip that's built for one thing only: doing massive amounts of simple math all at once. Think of it like a huge group of people all doing easy addition at the same time instead of one person trying to solve a very hard puzzle. This special chip is built to be very fast and, more importantly, very cool. It uses much less power to do the same work. Because it's so fast, it can run through all those rounded-off numbers in the blink of an eye. That's why you can now take a photo and the phone instantly knows where the dog is or how to make the sky look bluer. It's doing all that math on that one special chip.

HostIt still feels like a lot of work just to avoid using the internet. If the big computers in the cloud are so much stronger, why not just let them do the heavy lifting? Is it just about being able to use it when you're on a plane?

GuestThat's a part of it, but the bigger reason is actually about what you want to keep to yourself. When you use an ai that lives on a big server far away, you're sending your words, your photos, and your secrets across the web to a company you might not totally trust. When the ai lives on your phone, nothing ever leaves your pocket. Your phone can read your messages to help you write a reply, or look at your private photos to help you find a specific memory, and no one else ever sees it. Not even the people who made the phone. It makes the ai feel more like a personal helper and less like a spy. Plus, there's no delay. You don't have to wait for the signal to go to a giant building in another state and come back. It just happens.

HostSo it's really about making the phone understand us better without having to tell the whole world about it.

GuestExactly, it's about giving you that power in a way that feels fast and private, even if the math under the hood is getting a little bit simplified.

HostThe idea of a giant digital map being trimmed down like a garden hedge makes those billions of numbers feel a lot more manageable.

GuestThe real trick is that we're finding out these models don't need to be giant to be smart; they just need to be built for the pocket they live in.

HostIt's a long way from needing a giant room of computers to having all that knowledge tucked away in your jeans while you walk through the woods.

Made with Wander

A world of curiosity you can listen to. Explore endless questions, or ask your own.

Get the app