This seems like a step in the right direction. Much like language translation, doing it on-device is the only way to preserve people's data agency / privacy.
You probably underestimate the amount of effort Apple puts into not doing this, to maintain user privacy, and for a good while their services have suffered for it.
As an example I'd highlight the year in review feature between Apple Music and Spotify. "Replay" is significantly worse than "Wrapped" and I believe the difference is data handling is the key differentiator.
However, there are some advances in balancing privacy 2ith utility, as highlighted in this post from Apple ML research: https://machinelearning.apple.com/research/scenes-differential-privacy
I think this is the way to go. Rather than paying every random app a subscription to jerry-rig AI into their programming somehow, I’d like to have my own personal, and private, AI that plugs into a framework that each app offers. I’d also like to be able to purchase curated extensions to privately enhance my own AI.
You're talking about two very different technologies though, but both are confusingly called "AI" by overzealous marketing departments. The basic language recognition and regressive model algorithms they ship today are "Machine Learning", and fairly simple machine learning at that. This is generally the kind of thing we're running on simple CPUs in realtime, so long as the model is optimized and pre-trained. What we're talking about here is a Large Language Model, a form of neural network, the kind of thing that generally brings datacenter GPUs to their knees and generally has hundreds of parameters being processed by tens of thousands of worker neurons in hundreds of sequential layers.
It sounds like they've managed to simplify the network's complexity and have done some tricks with caching while still keeping fair performance and accuracy. Not earth shaking, but a good trick.
Wouldn't this absolutely hammer the battery though, or at least give the CPU a hard time? My understanding is that offloading the work to a cloud platform means that the processor-intensive inputting, parsing, generating, and outputting operations are done in purpose-built datacentres, and end user devices just receive the prepared answer.
Wouldn't this rinse the battery and increase the overall device temperature for "normal" end users?
Fair warning: I haven't read the two papers outlined in the article.
CPUs can have special hardware accelerators for stuff like this, and you'd be surprised how powerful our little phone CPUs are and how optimized stuff like this can become.
I'm showing my age here, but much like we had math coprocessors running beside the 286 and 386 gen CPUs to take on floating point operations; then graphics cards offloaded geometry-based math operations to GPU's - are we looking at AI-style die or chips to specifically work on AI functions?
Excuse my oversimplification, this isn't my field of expertise!
Yup, technology and especially phones have come a disgustingly long way in such a short amount of time. Running AI efficiently on them is the next step, one that we probably won't struggle with too much.
Apple already does a lot of this stuff. For example, it'll do offline face recognition for your photos while your phone is charging overnight.
Plus, Apple is ahead of the curve when it comes to performance on this stuff. You don't want to be running Stable Diffusion on your iPhone, but smaller AI is perfectly fine. Plus, unlike on Android, there are huge amounts of devices with ML accelerator chips that can run these models efficiently, allowing for power consumption optimisations by not having to provide a CPU fallback.
We'll have to see how effective this will be in practice, but Apple generally doesn't bring these types of features to their newer devices until they're ready for daily use.
🤖 I'm a bot that provides automatic summaries for articles:
Click here to see the summary
Apple’s latest research about running large language models on smartphones offers the clearest signal yet that the iPhone maker plans to catch up with its Silicon Valley rivals in generative artificial intelligence.
The paper was published on December 12 but caught wider attention after Hugging Face, a popular site for AI researchers to showcase their work, highlighted it late on Wednesday.
Device manufacturers and chipmakers are hoping that new AI features will help revive the smartphone market, which has had its worst year in a decade, with shipments falling an estimated 5 percent, according to Counterpoint Research.
Running the kind of large AI model that powers ChatGPT or Google’s Bard on a personal device brings formidable technical challenges, because smartphones lack the huge computing resources and energy available in a data center.
Apple tested its approach on models including Falcon 7B, a smaller version of an open source LLM originally developed by the Technology Innovation Institute in Abu Dhabi.
Academic papers are not a direct indicator of how Apple intends to add new features to its products, but they offer a rare glimpse into its secretive research labs and the company’s latest technical breakthroughs.