Entry № 041-8 / V-77 · 0:00 synced

The NEW Chip Inside Your Phone! (NPUs)

Techquickie@techquickie321.8K viewsApr 16, 20245:30
Source
YT
Views
321.8K
Subscribers
4.3M
Critic
?
Audience
?

0 up · 0 down · 0 ratings

Description

Check out the MSI MAG 1250GL PCIE5 at lmg.gg Thanks to Dr. Ian Cutress for his help with this video!

Promos

Check out his blog and YouTube channel: morethanmoore.substack.com youtube.com Neural processing units (NPUs) such as Apple's Neural Engine or the machine learning engine on Google Tensor chips can be found on the iPhone and the Pixel. How do they help run AI right on your phone? Leave a reply with your requests for future episodes. ► GET MERCH: lttstore.com ► GET EXCLUSIVE CONTENT ON FLOATPLANE: lmg.gg ► SPONSORS, AFFILIATES, AND PARTNERS: lmg.gg FOLLOW US ELSEWHERE --------------------------------------------------- Twitter: twitter.com Facebook: @LinusTech Instagram: @linustech TikTok: @linustech Twitch: twitch.tv

Start
AI OverviewDefault language

The video explains how neural processing units (NPUs) inside modern smartphones enable on-device artificial intelligence while balancing power, heat, and latency. It starts by contrasting NPUs with traditional CPU cores, explaining that NPUs are highly optimized for AI tasks and are designed to be embarrassingly parallel with limited die area so they can perform machine learning tasks without draining the battery or generating excessive heat. The discussion then moves to why devices still benefit from local AI despite powerful cloud servers: on-device inference reduces latency, preserves user privacy by keeping data on the device, and avoids round trips to the cloud for common tasks like voice recognition and image correction. The host also clarifies that not all AI tasks are suitable for on-device processing yet, noting that very large generative models still require cloud resources, while simpler features like live translation can run locally. Finally, the video describes the ongoing industry exploration to determine the sweet spot between on-device and cloud AI, highlighting how hardware makers, software developers, and ecosystem partners collaborate to bring more AI features to devices while watching for monetization paths and practical use cases. The episode closes by pointing to broader trends in AI hardware on both phones and PCs, and by inviting viewers to share future topic requests and continue watching for more comparisons of AI on-device versus in the cloud.

Topics · technology · mobile_devices · artificial_intelligence · computer_hardware · cloud_computing · privacy

Questions answered

What is an NPU and why is it useful in phones?
An NPU is a neural processing unit optimized for AI tasks, allowing on-device inference with lower latency and better privacy than cloud-based processing.
Why not run all AI on the cloud if it’s more powerful?
Cloud AI can be powerful, but on-device AI reduces latency, preserves privacy by keeping data local, and avoids constant data transfer to servers.
Which tasks can currently run on-device and which require cloud?
Lighter tasks like live translation and basic voice or image processing can run on-device, while more demanding generative AI models still rely on cloud resources.