Open in app
Cover art for How a phone AI chip speeds up machine learning

How a phone AI chip speeds up machine learning

Technology · 6 min listen

Get the app on mobile
Download on the App Store Get it on Google Play
Cover art for How a phone AI chip speeds up machine learning
0:00
0:00
Transcript

HostWe all have those moments where we point our phone at a strange flower in the woods and it tells us the name instantly, or we take a blurry photo in a dark room and the phone somehow makes it look clear and bright. It feels like there's a little bit of magic happening behind the glass, but it's actually down to a very specific piece of hardware tucked away inside.

HostHow does a tiny slice of metal and wire actually handle all that heavy lifting without getting too hot or draining the battery in ten minutes?

GuestIt really comes down to how the work is divided. In the old days, and even in most computers today, you have one main chip that's like a very, very smart person who's a bit of a generalist. This chip, the one we call the main brain, can do anything. It can run a spreadsheet, it can play a video, it can browse the web. But the catch is that it usually likes to do things one by one. It's incredibly fast at doing one thing, finishing it, and then moving to the next.

GuestBut when you ask a phone to recognize a face or figure out what a voice is saying, the math changes. Those tasks aren't about doing one hard thing at a time. They're about doing millions of tiny, simple things all at the same exact moment. If you try to force the main brain to do that, it gets overwhelmed. It's like asking a world-class math professor to solve ten million addition problems for first-graders. The professor could do it, but it would take forever and they would get exhausted. The AI chip is more like ten thousand first-graders all sitting in a room, each solving one tiny problem at the same time. They finish the whole batch in a second.

HostSo it's less about how smart the chip is and more about how many hands it has to work with?

GuestThat's a great way to think about it. If you look at the layout of a normal chip, it's built with a few very powerful paths for data to travel. We call these cores. You might have four or eight of them. But an AI chip, or what people sometimes call a neural engine, has hundreds or even thousands of these little work zones. They're stripped down. They can't do everything the main chip does. They can't run your whole phone or manage the screen. But they're built for one specific kind of math that we call matrix multiplication.

GuestThink of a photo as a giant grid of numbers. Every pixel has a value for color and brightness. To make a blurry photo look sharp, the phone has to look at every single one of those millions of numbers and change them based on the numbers next to them. This is where the AI chip shines. It can grab the whole grid and crunch all those numbers at once because it has all those tiny, specialized lanes open and ready to go.

HostWait, I have a hard time seeing why this is so different from what chips have always done. My phone from five years ago could still take photos and recognize my face. Was it just doing it slower, or was it using a different trick entirely?

GuestIt was doing it the hard way. It was using the main brain or the graphics chip to do the work. The problem is that because those chips weren't built for this specific math, they had to work much harder. It's the difference between driving a regular car through a field and using a tractor. The car can get across the field, but it's going to strain the engine, it might get stuck, and it's going to burn a lot of gas.

GuestWhen your phone uses a specialized AI chip, it's using a tool built exactly for that terrain. Because the chip is physically wired to do this one type of math, it doesn't have to spend energy thinking about how to do it. The paths are already laid out in the metal. This means the phone stays cool in your hand and your battery lasts much longer. It also means the phone can do things in real-time. You can point your camera at a sign in a foreign language and see the translation pop up right over the words as you move the phone. That kind of speed is only possible because the chip is doing those millions of calculations every time the frame of the video changes.

HostBut if the chip is so specialized, does that mean it's useless for anything else? Like, if I'm just texting or reading an email, is that part of the phone just sitting there doing nothing?

GuestMostly, yes. It stays quiet until it's needed. But you would be surprised how often it kicks in. Even when you're just typing, there's often a small model running to guess the next word you're going to say. That's a tiny bit of machine learning. In the past, that might have been done by the main brain, but now the phone can pass that off to the AI chip. It keeps the main brain free to make sure the app you're using feels smooth and fast.

GuestThere's also a big win for privacy here. Before we had these chips, a lot of the heavy lifting had to happen on a giant server in a building miles away. You would send your voice or your photo up to the cloud, a massive computer would do the math, and send the answer back. That takes time, and it means your data is leaving your device. With these new chips, the heavy math stays in your pocket. The phone is powerful enough to learn your voice or recognize your friends without ever telling the rest of the internet about it.

HostIt sounds like we have basically moved a tiny version of those giant server farms directly into our pockets.

GuestWe really have. We're packing more and more of these tiny work zones into that sliver of silicon every year. The real shift is that we have stopped trying to make the main brain faster and faster. Instead, we're giving it specialized helpers. It's a change in the way we think about power. It's no longer about one big, loud engine. It's about a whole team of tiny, quiet tools that know exactly when to step in.

HostThose thousands of tiny workers are the reason that blurry photo of the dog finally looks like a masterpiece.

GuestThe chip is the only reason your phone can see the world as it happens instead of just snapping a static picture and thinking about it later.

HostThe plant in the park finally has a name because that little slice of metal is doing the work we used to leave to giant buildings full of computers.

Made with Wander

A world of curiosity you can listen to. Explore endless questions, or ask your own.

Get the app