Apple's On-Device and Server Foundation Models
941 points
by 2bit
5 months ago
| 38 comments
| machinelearning.apple.com
| HN
rishabhjain1198
5 months ago
[-]
For people interested in AI research, there's nothing new here.

IMO they should do a better job of referencing existing papers and techniques. The way they wrote about "adaptors" can make it seem like it's something novel, but it's actually just re-iterating vanilla LoRA. It was enough to convince one of the top-voted HackerNews comments that this was a "huge development".

Benchmarks are nice though.

reply
lolinder
5 months ago
[-]
> For people interested in AI research, there's nothing new here.

Was anyone expecting anything new?

Apple has never been big on living at the cutting edge of technology exploring spaces that no one has explored before—from laptops to the iPhone to iPads to watches, every success they've had has come from taking tech that was already prototyped by many other companies and smoothing out the usability kinks to get it ready for the mainstream. Why would deep learning be different?

reply
csvm
5 months ago
[-]
Prototyping tech is one thing; making it a widely adopted success is another. For instance, Apple was the first to bring WiFi to laptops in 1999. Everyone laughed at them at the time. Who needs a wireless network when you can have a physical LAN, ey?
reply
steve1977
5 months ago
[-]
> For instance, Apple was the first to bring WiFi to laptops in 1999. Everyone laughed at them at the time. Who needs a wireless network when you can have a physical LAN, ey?

From https://en.wikipedia.org/wiki/AirPort:

"AirPort 802.11b card"

"The original model, known as simply AirPort card, was a re-branded Lucent WaveLAN/Orinoco Gold PC card, in a modified housing that lacked the integrated antenna."

reply
gumby
5 months ago
[-]
That was also how lucent’s access points worked.
reply
collinmanderson
5 months ago
[-]
It was pretty neat how you could take the PCMCIA card out of an AirPort or WaveLAN/Orinoco and stick it in a laptop and it would just work.
reply
Matl
5 months ago
[-]
On the other hand, people who laughed at them removing the 3.5mm jack can still safely laugh away.
reply
throw0101d
5 months ago
[-]
> On the other hand, people who laughed at them removing the 3.5mm jack can still safely laugh away.

Then laugh at Samsung and their flagship line of phones as well, since they haven't had headphone jacks for a while now. "After Note 10 dumps headphone jack, Samsung ads mocking iPhone dongles disappear" (2019):

* https://www.cnet.com/tech/mobile/after-removing-headphone-ja...

"Samsung is hiding its ads that made fun of Apple's removal of headphone jack":

* https://www.androidauthority.com/samsung-headphone-jack-ads-...

reply
tivert
5 months ago
[-]
>> On the other hand, people who laughed at them removing the 3.5mm jack can still safely laugh away.

> Then laugh at Samsung and their flagship line of phones as well, since they haven't had headphone jacks for a while now. "After Note 10 dumps headphone jack, Samsung ads mocking iPhone dongles disappear" (2019):

I totally do. One of the problems with Apple is the industry seems to mindlessly ape their good and bad decisions. Their marketing has been so good, many people just assume whatever they do must be the best way.

reply
barbecue_sauce
5 months ago
[-]
At the time I felt like Apple was getting rid of the 3.5mm jack as a potential bottleneck for future iPhone designs (as one of the limiting aspects of form factor), but there still doesn't seem to be anything design-wise to justify it, even several years later. It is very clear now that it was merely to encourage Air Pod adoption.
reply
user_7832
5 months ago
[-]
I would say this was obvious to the cynical of us from the very beginning. Unless you are trying to go portless (for water resistance perhaps?) or have a very thin phone, there’s very little benefits of removing the jack… except to drive airpod sales, of course.
reply
arvinsim
5 months ago
[-]
I draw the line on going pure portless. I would like to retain USB-C thank you very much.
reply
WanderPanda
5 months ago
[-]
I mean to go thinner than 6/6s I can see the 3.5mm causing trouble. Part of me is still sad they bounced they went the other direction when it comes to iPhone thickness
reply
monooso
5 months ago
[-]
> One of the problems with Apple is the industry seems to mindlessly ape their good and bad decisions...

That's not a problem with Apple.

reply
talldayo
5 months ago
[-]
It's more of a regulatory problem, under a certain light.
reply
josephg
5 months ago
[-]
Regulation that stops companies copying ideas from one another would be a disaster.
reply
talldayo
5 months ago
[-]
I'm more suggesting that bad decisions should be litigated against fast-and-early, so other companies aren't encouraged to follow in Apple's footsteps. If every company had their own Lightning connector, there would be no choice but to force them all to converge. The original sin is letting it happen at all, in the first place.
reply
josephg
5 months ago
[-]
Who decides which ideas are good and bad? I assume you wouldn’t want regulators to have retroactively forced Apple to keep floppy disk drives in their computers. Or cdrom drives? It’s just the “obviously bad” ideas that should be banned, right?

Do you have a crystal ball that lets you know ahead of time which choices are good and bad? Even in retrospect I’m not sure Apple made the wrong choice with the lightning connector. It’s a better connector in just about every way than micro-usb, which was the only standard alternative at the time. Apple’s experience with lightning was rolled into the design process for usb-c, which as I understand it they were heavily involved in. USB-c might not exist as it does today without Apple’s experiments with the lightning connector.

Even if we pretend you’re better at picking winners and losers in the tech world than Apple and Samsung, do you think regulators are going to be as canny as you are with this stuff? US politicians don’t seem to understand the difference between Facebook and the internet. Are you sure you want them making tech choices on your behalf?

If you ask me, I think regulators would make a dogs breakfast of it all. If they were involved we’d probably still have laws on the books mandating that all laptops still have parallel ports or PCMCIA slots or something. The free market can sure take its time figuring this stuff out. But competition does, usually, lead to progress.

reply
throw0101d
5 months ago
[-]
> If every company had their own Lightning connector, there would be no choice but to force them all to converge.

You mean like forcing every phone maker to use the awesome connecter known as micro USB-B?

* https://en.wikipedia.org/wiki/Common_external_power_supply

reply
monooso
5 months ago
[-]
I don't really see how you could litigate against dropping the headphone jack from the iPhone, which is the context of this thread.
reply
deanishe
5 months ago
[-]
> One of the problems with Apple is the industry seems to mindlessly ape their good and bad decisions.

That's hardly Apple's fault, but it is so annoying. Hardly any of my cables have proper reinforcement sleeves any more.

Apple started making awful cables that snapped at the plugs, and everybody else just copied them.

reply
skeaker
5 months ago
[-]
We absolutely do laugh at both already.
reply
krick
5 months ago
[-]
I would gladly laugh, but it's nearly impossible to buy a good phone now. TBH I don't care that much about my phone not having a 3.5mm (even if I would need to use wired headphones, which is very rare now, I can use an USB adapter for that), but there are basically no phones without this stupid hole in the display, or with a good dedicated (not under screen) fingerprint scanner (because who needs that, when you can have face recognition, right?) All top-line phones are like $1500 now, but still are considered like disposable products that are naturally expected to be changed every 2 years. Batteries are not removable, yet devices are not actually (reliably) waterproof.

And maybe I'm wrong, but somehow it feels each improvement like that was actually pioneered by Apple. In the dreamworld of free-market enthusiasts this should have made Apple bankrupt or iPhone a very niche consumer device, but in the real world everything just became iPhone. There are some rare exceptions, but these are either outright experimental and gimmicky (because being different is their identity), or just bottom-of-the-line products that have these "intentional defects" that should make you chose the more expensive option.

reply
lebed2045
5 months ago
[-]
i seems you exclude many android options, that have hidden selfie camera and good working fingerprint scanner. ZTE Axon 20 5G as an example.
reply
joshstrange
5 months ago
[-]
This is such a tired talking point. Use a (lightning|USB-C)->3.5mm adapter or use bluetooth.
reply
astrange
5 months ago
[-]
And your experience for phone use cases will be better, because walking with wired headphones in gives you nasty telephonic effects (sound transmission along the cable) and they get tangled up.
reply
bigstrat2003
5 months ago
[-]
The two aren't mutually exclusive. Those who wish to use Bluetooth headphones can happily use them, while those who prefer wired can continue to use them. There's no reason smartphone manufacturers shouldn't support both.
reply
astrange
5 months ago
[-]
They do, you can get the little dongle and it has superior audio quality to most audiophile DACs on the market.

But that 3.5mm port takes up a lot of room that could be used for more battery, backup antennas for when the user's hand is covering one of them, vibration motors etc.

reply
lebed2045
5 months ago
[-]
I'm pretty sure you could find justification even for why top models don't support microSD card expansion. The problem with this line of argument is that if they wanted to, they could support both without any issues. The real reason is money. It's more profitable to have Bluetooth only when you also make AirPods, and not include storage expansion when you sell built-in memory options at a 400% markup.
reply
jb1991
5 months ago
[-]
Interesting that you suggest laughing at their decision to remove the headphone jack, when it was actually just the first of an industry-wide shift that has done so by other companies.
reply
bildung
5 months ago
[-]
Was that really the case? I remember they were mocked for e.g. offering wifi only, firewire only etc., while the respective removed alternatives were way more common.
reply
blihp
5 months ago
[-]
In the consumer space at least, WiFi was nowhere to be seen on a typical PC when Apple adopted it. Same with USB. So while it technically originated and existed elsewhere, there was no serious traction on it prior to Apple adoption.

What you say is also true: many people weren't ready to ditch the old when Apple decide to deprecate it.

reply
josephg
5 months ago
[-]
This has been true for ages. They were the first to ditch the floppy disk drive and later cd drive in their computers. Both choices were very controversial at the time.
reply
paulmd
5 months ago
[-]
They were also the first to usb-c nirvana - they were shipping laptops with thunderbolt in 2011, and moved to usb-c in 2016, giving you four full-capability outputs at a time when most laptops had one, sometimes, and it took another 5 years before most laptops adopted at least one as standard and premium laptops sometimes had two.

(People look down on the move to usb-c which I don’t quite get, everyone seems to fawn over usb-c in other contexts but macbooks, amirite!?!?. Yes it’s nice to have a hdmi port but fundamentally if you buy into this vision that usb-c does everything but you also want to use a bunch of legacy ports (vs thunderbolt video, thunderbolt networking, etc) then obviously you’re going to have to have dongles, and people supposedly buy into that vision in other contexts. Apples implementation of that vision was fundamentally at least a decade ahead of the curve there, if you’re going to do that you want lots of ports and you want every port to do everything, not “this one is the only one that can charge fast”, “that one doesn’t have video output”, “if you run both ports they drop to some weird lower capability because you’re dividing the controller”. Those complaints are the things people don’t like about the base-tier M-series processors today, and apple’s previous models solved that problem long before anyone else did.)

Hell until very very recently a lot of the time the competition didn’t even have thunderbolt/Pcie tunnelling… you got 10gbps usb-c and a grab-bag of charging and display features, and you’re gonna like it. That’s still the case with motherboards and it’s literally only with this years’ release that we’re finally getting usb4/thunderbolt as standard on high-end boards. Literally more than a decade from when apple started putting thunderbolt on laptops, almost a decade from the era of 4x tb3 full-spec ports.

… and reminder that in classic usb fashion, usb4 still doesn’t even guarantee Pcie tunneling support. So really it can still be just a normal usb-c 10gbps connection in a silly mustache and trench coat. Even on the next-gen stuff. What’s the term for doing an ok, moderately competent but not even exceptionally good job while your competition repeatedly shoots themselves in the face, again? But it’s by design - the intent is deceiving and manipulating the customer into buying last year’s junk, it’s working as intended for USB-IF’s real customers and stakeholders.

https://en.wikipedia.org/wiki/USB4#cite_ref-auto2_15-0

reply
talldayo
5 months ago
[-]
> They were also the first to usb-c nirvana

Intel says hi.

> People look down on the move to usb-c which I don’t quite get

People loved Thunderbolt for replacing Firewire. They hated Apple's choice because these USB-C Macbooks shipped with precisely zero USB-A ports and relegated every user to carrying around a dongle.

The year is 2024, we're almost a decade out from Apple going all-in on USB-C and the predominant peripheral connector is still type-A. I don't like it either, but plugging our ears and pretending like it's not a problem is silly and only makes consumers mad.

> and reminder that in classic usb fashion, usb4 still doesn’t even guarantee Pcie tunneling support.

That is in fact the correct default to use. Ever heard of Thunderspy? https://thunderspy.io/

reply
josephg
5 months ago
[-]
> Intel says hi.

Intel helped make this move possible, but it doesn’t manufacture laptops. Apple took the heat for “donglegate”.

On the x86 desktop, usb-c is still surprisingly rare. I think my motherboard (that’s less than a year old) only has 2 usb-c ports and 8x usb-a.

reply
talldayo
5 months ago
[-]
> Intel helped make this move possible, but it doesn’t manufacture laptops. Apple took the heat for “donglegate”.

And rightfully so. They took Intel's technology and told an unprepared and uninterested industry to switch or die. Naturally, very few manufacturers switched over and Apple's all-or-nothing strategy made more people mad than happy.

Having 4 lanes of Thunderbolt connectivity is awesome. It doesn't really fix the fact that none of them can easily connect to a wired keyboard or mouse.

> On the x86 desktop, usb-c is still surprisingly rare.

My motherboard only has one TB connector, everything else is type-A too. Most of the bandwidth is broken out over SATA or PCIe internally, and frankly I don't regret it one bit. 99% of my life, there is nothing plugged into that Thunderbolt port.

reply
IOT_Apprentice
5 months ago
[-]
Apple was first with 64 bit iPhone chips. Remember Qualcomm VP at the time claimed it was nothing. Apple Silicon for M1 was impressive for instant in low power high performance.
reply
lolinder
5 months ago
[-]
Those are both still (major) incremental improvements to known tech, not cutting-edge research. Apple takes what other companies have already done and does it better.
reply
sunshinerag
5 months ago
[-]
all cutting-edge research other companies are supposedly doing are also incremental. Depends on your vantage point.
reply
prmoustache
5 months ago
[-]
But last at bringing a calculator on the iPad =)
reply
viscanti
5 months ago
[-]
No one has ever brought a native (not 3rd party) calculator to the iPad before. Apple is the first.
reply
froggit
5 months ago
[-]
> No one has ever brought a native (not 3rd party) calculator to the iPad before. Apple is the first.

I'm not sure I understand... Considering apple makes the ipad, wouldn't all ipad calculators other than apple's be 3rd party by definition?

reply
jeanlucas
5 months ago
[-]
> For people interested in AI research

I think he is pointing out for people interested in research.

OTOH, it is interesting to see how a company is applying AI to customers at the end. It will bring up new challenges that will be interesting from at least an engineering point of view.

reply
rmbyrro
5 months ago
[-]
I think you misinterpreted OP's comment. Apple makes it sound like there's smth new, but there isn't. They don't have to innovate, but it's good practice to credit who've done what they're taking and using. Also to use the names everyone else is already using.
reply
marci
5 months ago
[-]
The strange thing is Apple did mention (twice) in the article that their adapters are loras so I don't understand OP's comment.
reply
aceazzameen
5 months ago
[-]
I gathered from OP's "huge development" comment he was talking about others people's popular perception that it wasn't a lora.
reply
steve1977
5 months ago
[-]
> Also to use the names everyone else is already using.

That would be a very un-Apple thing to do. They really like to use their own marketing terms for technologies. It's not ARM, it's Apple Silicon. It wasn't Wi-Fi, it was AirPort. etc. etc.

reply
gumby
5 months ago
[-]
> It wasn't Wi-Fi, it was AirPort. etc. etc.

FWIW the term “airport” predated the name “wifi” — in those days you had to otherwise call it IEEE 802.11.

And the name as great: people were buying them like crazy and hiding them in the drop ceiling to get around the corporate IT department. A nice echo of how analysts would buy their own apple II + visicalc to…get around corporate IT.

I’m OK with Apple using “apple silicon” as the ARM is only part of it.

Just commenting on your two examples; in general I agree with your point.

reply
steve1977
5 months ago
[-]
As far as I know, both the AirPort trademark and the term Wi-Fi got introduced in 1999 (could be that AirPort was a couple of weeks earlier)
reply
gumby
5 months ago
[-]
Airport came out in the beginning of the year, maybe January, at Macworld. WECA renamed themselves the WiFi Alliance a few years later, but the name (and trademark) had to exist for it to be worth them doing so.

WECA reminds me of other "memorable" names of the era, my favorite being PCMCIA, though VESA is another fave.

reply
steve1977
5 months ago
[-]
Thanks for the info. That was pretty much around the time when I started out in IT. And yes PCMCIA, what a mouthful! I remember Ethernet cards in this format.
reply
chuckadams
5 months ago
[-]
People Can’t Memorize Computer Industry Acronyms
reply
astrange
5 months ago
[-]
> It wasn't Wi-Fi, it was AirPort.

Except in Japan, where it's AirMac. And China, where it's WWAN not Wi-Fi.

reply
Tijdreiziger
5 months ago
[-]
See also: FireWire, iSight, Retina, FaceTime, etc.
reply
KerrAvon
5 months ago
[-]
None of these really fit the pattern. Apple invented FireWire, called it FireWire, and other companies chose to call it different things in their implementations (partly because Apple originally charged for licensing the name, IIRC). iSight is an Apple product. FaceTime is an Apple product. Retina is branding for high-resolution displays beyond a certain visual density.
reply
steve1977
5 months ago
[-]
"Apple invented FireWire" is maybe not fully accurate (but actually a good example of the point here).

Wikipedia: FireWire is Apple's name for the IEEE 1394 High Speed Serial Bus. Its development was initiated by Apple[1] in 1986,[3] and developed by the IEEE P1394 Working Group, largely driven by contributions from Sony (102 patents), Apple (58 patents), Panasonic (46 patents), and Philips (43 patents), in addition to contributions made by engineers from LG Electronics, Toshiba, Hitachi, Canon,[4] INMOS/SGS Thomson (now STMicroelectronics),[5] and Texas Instruments.

What might be interesting in this regard is that Sony was also using its own trademark for it: "i.LINK".

reply
caseyy
5 months ago
[-]
> Apple has never been big on living at the cutting edge of technology

There was such a time. Same as with Google. Interestingly, around 2015-2016 both companies significantly shifted to iterative products from big innovations. It's more visible with Google than Apple, but here's both.

Apple:

- Final Cut Pro

- 1998: iMac

- 1999: iBook G3 (father of all MacBooks)

- 2000: Power Mac G4 Cube (the early grandparent of the Mac Mini form factor), Mac OS X

- 2001: iPod, iTunes

- 2002: Xserve (rackable servers)

- 2003: Iterative products only

- 2004: iWork Suite, Garage Band

- 2005: iPod Nano, Mac mini

- 2006: Intel Macs, Boot Camp

- 2007: iPhone and Apple TV

- 2008: MacBook Air, iPhone 3G

- 2009: iPhone 3Gs, all-in-one iMac

- 2010: iPad, iPhone 4

- 2011: Final Cut Pro X

- 2012: Retina displays, iBooks Author

- 2013: iWork for iCloud

- 2014: Swift

- 2015: Apple Watch, Apple Music

- 2016: Iterative products only

- 2017: Iterative products mainly, plus ARKit

- 2018: Iterative products only

- 2019: Apple TV +, Apple Arcade

- 2020: M1

- 2021: Iterative products only

- 2022: Iterative products only

- 2023: Apple Vision Pro

Google:

- 1998: Google Search

- 2000: AdWords (this is where it all started going wrong, lol)

- 2001: Google Images Search

- 2002: Google News

- 2003: Google AdSense

- 2004: Gmail, Google Books, Google Scholar

- 2005: Google Maps, Google Earth, Google Talk, Google Reader

- 2006: Google Calendar, Google Docs, Google Sheets, YouTube bought this year

- 2007: Street View, G Suite

- 2008: Google Chrome, Android 1.0

- 2009: Google Voice, Google Wave (early Docs if I recall correctly)

- 2010: Google Nexus One, Google TV

- 2012: Google Drive

- 2013: Chromecast

- 2014: Android Wear, Android Auto, Google Cardboard, Nexus 6, Google Fit

- 2015: Google Photos

- 2016: Google Assistant, Google Home

- 2017: Mainly iterative products only, Google Lens announced but it never rolled out really

- 2018: Iterative products only

- 2019: Iterative products only

- 2020: Iterative products only, and some rebrands (Talk->Chat, etc)

- 2021: Iterative products only, and Tensor Chip

- 2022: Iterative products only

- 2023: Iterative products only, and Bard (half-baked).

reply
vile_wretch
5 months ago
[-]
Some of your choices and what you consider iterative/innovative are strange to me. For 2009, a chassis update for the iMac and a spec/camera bump for the iPhone doesn't seem particularly innovative especially in comparison to say the HomePod in 2017 or Satellite SOS in 2022.

Also small correction but iTunes (as Soundjam MP) was originally third-party software and Final Cut was acquired by Apple.

reply
caseyy
5 months ago
[-]
Yes, it's not perfect.

About iTunes: I did not know that! Thank you.

About iterative/innovative: I considered hardware and software that became household names or general knowledge to be significant innovations. It is not rigorous, I tried to include more rather than less. Still, on some years these companies mostly did version increases for their hardware and software, like new iOS and macOS versions, and that was it. Those years I marked as iterative.

I included a few too many iPhones, although when I wrote that, my thought process was that these phones were pivotal to how iPhones developed. I should have included the original iPhone, and iPhone 3G — the first iPhone developed around the concept of an app platform and with an App Store. This has undoubtedly been a big innovation. iPhone 4 and 3Gs, perhaps, should not have been included.

It's loose and just to illustrate a general trend, individual items are less important, we could all pick slightly different ones. But I believe the trend would remain.

reply
KerrAvon
5 months ago
[-]
You're missing Apple Silicon, which has had a huge impact across the entire industry even if random soccer dad doesn't know about it -- if any one thing is responsible for Intel's marketshare collapsing in the future, the M series of processors is it.
reply
AshamedCaptain
5 months ago
[-]
> if any one thing is responsible for Intel's marketshare collapsing in the future, the M series of processors is it.

If anything, that would be AMD. But I'd guess they're both more worried about the entire desktop & laptop market shrinking way more than anything Apple does.

reply
barbecue_sauce
5 months ago
[-]
No Newton?
reply
caseyy
5 months ago
[-]
Missed it, but should have been included for 1998. A very good example.
reply
lolinder
5 months ago
[-]
The iMac refined a form factor that dated back to at least Commodore. The iBook came after decades of iteration by other companies on laptops. The Cube was just a PC with a more compressed form factor. The iPod came a few years after other commercial digital media players. Etc, etc.

Note that I'm not saying that there's anything wrong with their approach or that they didn't make real improvements. I'm just saying that Apple has never produced any successful product that would count as "new" to someone interested in cutting-edge research. They've always looked around at things that exist but aren't yet huge successes and given them the final push to the mainstream.

reply
caseyy
5 months ago
[-]
It depends on the definition of "new". With some definitions, we may claim that nothing is ever new — we may say computers started with the Antikythera mechanism and abaci, or maybe before. With other definitions (like "new" as in a "new for most people") we will see that Apple has brought about many new things. So we need to agree on the definition.

I used the definition of new somewhere between "new for most people", "newly popular", and "meaningfully advanced from the previous iteration". With such a definition, I think you can agree with me.

reply
KerrAvon
5 months ago
[-]
In the consumer space, I'm not sure I can think of any examples from anyone ever that are examples of cutting-edge research at the time. It's hard to build consumer products on the bleeding edge. You'd be releasing phones today using CHERI, for example, which is not quite ready for prime time.
reply
com2kid
5 months ago
[-]
> - 2000: Power Mac G4 Cube (the early grandparent of the Mac Mini form factor), Mac OS X

The Next Cube being a very obvious inspiration here.

Mac OS X was a needed step, and entire books have been written about its creation. While it has some innovative pieces, it was very much a do or die situation for Apple, not brought on by innovation so much as the need to survive. (I'm sure BeOS fans will argue that BeOS was the real innovative OS. ;) )

> - 2001: iPod, iTunes

iPod was a, very well done, refinement of the existing MP3 device category. iTunes' innovation was the licensing deal they got with record companies, that is what really surprised everyone.

> - 2002: Xserve (rackable servers)

Not sure how this is an innovation? Rack mount servers had been around a long time.

> - 2004: iWork Suite,

Microsoft literally had a product called Microsoft Works that was originally released in 1988 and came shipped on tons of home PCs.

> - 2005: iPod Nano, Mac mini

The iPod Nano was cool, the Mac mini was a wonderful feat of engineering and cost reductions.

> - 2006: Intel Macs, Boot Camp

Necessity brought this about.

> - 2007: iPhone and Apple TV

This is a perfect example of Apple entering an existing product category and doing an amazing job of execution. Palm, Blackberry, and Microsoft were already releasing very capable smart phones, but none of them bothered polishing the product (MS and Blackberry focused on corporate sales, end user experience was not the top priority) and while Apple did push a lot of technology forward to make the iPhone (notably screen tech and using capacitive touch screens), their main innovation was realizing they could get customers to pay for a cell phone. For those who don't remember, prior to the iPhone, most customers got their cellphone for "free" from their cellular provider in return for agreeing to a 2 year contract. Apple realized if they made a really nice product, that people would buy it.

Apple also did some really cool, and now largely forgotten about, positioning here involving the iPod Touch, where the iPod Touch had access to the full App Store and became entry level "kids toy" devices that got people into the ecosystem.

Heck arguably the App Store was a larger innovation than the phone.

Fun fact: Microsoft had an App Store ready to launch for Windows Mobile (pre Windows Phone 7) but it was scrapped at the last minute because an exec thought that "no way would phone users ever pay for apps".

(When I joined MS in 2006 the source code was still laying around in the Windows Mobile source tree!)

Apple TV was arguably too early at this point in time, I'd say it didn't really take off until later generations when more streaming media was available.

But innovative? Web TV was out in the late 90s (!!) and Microsoft tried to do Media Center PC's since 2002. Heck for awhile with Xbox 360, Microsoft basically owned the "TV smart device" market segment. (and they released the Xbox One as a media streaming device and sort of forgot that it was also a games console... oops)

As with most products, Apple just did a really good job of it, but Roku has dramatically outplayed everyone else in the market by getting embedded directly into cheap TVs sold at Costco.

> - 2010: iPad, iPhone 4

iPad is/was an amazing product, and it succeeded thanks to great apps.

It was also a refinement of Tablet PCs which have been around since the late 80s/early 90s.

Apple was willing to do what Microsoft wasn't, break all back-compat and make a really good single purpose device. Microsoft's tablets (some of which are really damn nice!) were always hamstrung because Microsoft never could go all in on abandoning existing x86 software. (The closest attempt being Windows 8 RT, which managed to make the perfect set of compromises to anger everyone!)

> - 2015: Apple Watch, Apple Music

The first generation Apple Watch was... meh. Now, I say this as someone who was working on a direct competitor - I am still not sure how it has such a miserable battery life and why such a massively overpowered CPU and GPU still dropped frames.

I am not sure what is innovative about Apple Music, vs every other streaming music service.

reply
throwaway4good
5 months ago
[-]
I thought the news of them using Apple Silicon rather than NVIDIA in their data centers was significant.

Perhaps there is still hope of a relaunch of xserve; with the widespread use of Apple computers amongst developers Apple has a real chance of challenging NVIDIA's CUDA moat.

reply
pjmlp
5 months ago
[-]
Not at Apple's price points.
reply
throwaway4good
5 months ago
[-]
I think NVIDIA has the highest hardware markup at the moment.
reply
Hugsun
5 months ago
[-]
You get considerably more ML FLOPS per dollar in a 4090 than any mac. It seems like the base M2 MAX is at roughly the same price point. It does grant you more RAM.

Quadro and Tesla cards might be a different story. I would still like to see concrete FLOPS/$ numbers.

reply
hajile
5 months ago
[-]
They don't need the entire mac. Their cost per Max chip is probably $200-300 which beats the 4090 by a massive margin and each chip can do more than a 4090 because it also has a CPU onboard.

4090 peaks out at around 550w which means they can run 5+ of their Max chips in the same power budget.

A 4090 is $2000. Apple can probably get 5 chips on a custom motherboard for that cost. They'll use the same amount of power, but get a lot more raw compute.

reply
Hugsun
5 months ago
[-]
> Their cost per Max chip is probably $200-300 which beats the 4090 by a massive margin...

That's true. I was talking about end user pricing.

> ...each chip can do more than a 4090 because it also has a CPU onboard.

That's a strange thing to say. It has a CPU, correct. It makes the chip more versatile but for data center ML tasks it doesn't really matter. A 4090 chip also has much more ML relevant compute per chip. So apple's chips can't really "do more than a 4090" in any relevant way.

Of course apple pays less for their in house made chips vs external products. That comparison doesn't seem relevant to the context, e.g. they're not going to challenging CUDA with internal chips.

They might get more compute per watt though. My guess is that nvidias datacenter chips are competitive in that space, but that's another story.

reply
eek2121
5 months ago
[-]
The GPU in the M-series is much slower than a 4090. 4060-4070ish performance at best, and it varies quite a bit.
reply
ribit
5 months ago
[-]
You need to consider this in the context of the relevant task. Nvidia GPUs have extremely high peak performance for GEMM, but when working with LLMs, bandwidth (and RAM capacity) becomes the limiting factor. There is a reason why real ML-focused datacenter Nvidia GPUs use much wider RAM interfaces and a much higher price point. The M2 Ultra might not have the raw compute, but it has a lot of RAM and large caches.
reply
hajile
5 months ago
[-]
If they can get 5 4070s for the price and power of one 4090, that's a win for them as they'll get more performance per dollar and per watt.
reply
talldayo
5 months ago
[-]
> and per watt

Part of the advantage of using "one 4090" is that the max TDP is only 450w, as opposed to 5 M2 Ultras running at ~150w each. When you scale up to Nvidia's latest Blackwell architecture, I genuinely don't know how Apple could beat them on performance-per-watt. Buying M2 Ultras wholesale is probably cheaper than an NVL72 cluster, but certainly not what you'd want to use for Linux or maximizing AI-based performance-per-watt.

reply
hajile
5 months ago
[-]
You are missing the point. We're discussing if Apple can use their own chips more cheaply than buying Nvidia's chips.

The Max TDP is not actual peak power consumption. Gamer's nexus recorded 500w peak and almost 670w overclocked. Most reviews I've looked at seem to put peak power consumption around 550w.

M2 Ultra wasn't even mentioned and it uses more than 150w. The correct question would be about M3 Max as we have solid numbers on it. M3 Max uses around 100w when both the GPU and CPU are heavily utilized and less than that when only the GPU is used.

This means that Apple could run 5 of their M3 Max chips in the same peak power as the 4090. But wait, there's more. 4090 doesn't run in a vacuum. It requires a separate CPU setup and a couple hundred more watts.

That means we could power 7 or so M3 Max chips with that same amount of power.

Of course, this isn't the whole story. 4090 isn't a professional chip either (while Apple can bin and certify their own CPUs and know they're getting a server-grade chip) and the 4090 also doesn't have nearly enough RAM. H100 starts at $25,000 and goes up. Apple could buy 75-100 M3 Max chips for that kind of money. That's certainly a load more compute than H100 would offer. Blackwell will be even more expensive in comparison.

reply
throwaway4good
5 months ago
[-]
The M2 is a chip designed to be in a laptop (and it is quite powerful given its low power consumption). Presumedly they have a different chip or at least completely different configuration (RAM, network, etc.) in their data centers.
reply
mrweasel
5 months ago
[-]
The interesting point here is that developers targeting the Mac can safely assume that the users will have a processor capable of significant AI/ML workloads. On the Windows (and Linux) side of things, there's no common platform, no assumption that the users will have an NPU or GPU capable of doing what you want. I think that's also why Microsoft was initially going for the ARM laptops, where they'd be sure that the required processing power is available.
reply
qwytw
5 months ago
[-]
> The interesting point here is that developers targeting the Mac can safely assume that the users will have a processor capable of significant AI/ML workloads

Also that a significant proportion (majority?) of them will have just 8 GB of memory which is not exactly sufficient to run any complex AI/ML workloads.

reply
talldayo
5 months ago
[-]
Easy solution; just swap multiple gigabytes of your model to SSD-based ZRAM when you run out of memory. What could possibly go wrong?
reply
skohan
5 months ago
[-]
I believe MS is trying to standardize this, in the same way as they do with DirectX support levels, but I agree it's probably going to be inherently a bit less consistent than Apple offerings
reply
pjmlp
5 months ago
[-]
DirectML can use multiple backends.
reply
InsomniacL
5 months ago
[-]
That sounds like a big issue, but surely assuming for either case is bad.

I expect OS's will expose an API which, when queried, will indicate the level of AI inference available.

Similar to video decoding/encoding where clients can check if hardware acceleration is available.

reply
treprinum
5 months ago
[-]
How does it help me (with maxed out M3 Max) that Apple might have some chip in the future right now? I do DL on A6000 and 4090, not waiting until Apple produces a chip someday that is faster than 1650 in ML...
reply
sofixa
5 months ago
[-]
That's probably where Microsoft's "Copilot+ PCs" come in.
reply
pjmlp
5 months ago
[-]
Plus DirectML, wich as the name implies, builds on top of DirectX, allowing multiple backends, CPU, GPU, NPU.
reply
skohan
5 months ago
[-]
There was a rumor floating around that Apple might try to enter the server chip business with an AI chip, which is an interesting concept. Apple's never really succeeded in the B2B business, but they have proven a lot of competency in the silicon space.

Even their high-end prosumer hardware could be interesting as an AI workstation given the VRAM available if the software support were better.

reply
jitl
5 months ago
[-]
> Apple's never really succeeded in the B2B business

Idk every business I’ve worked and all the places my friends work seem to be 90% Apple hardware, with a few Lenovo issued for special case roles in finance or something.

reply
neodymiumphish
5 months ago
[-]
They mean server infrastructure.
reply
esskay
5 months ago
[-]
Of course you do, Apple's selling mobile SOC's not high end cards. That doesn't mean they're incapable of making them for the right application. You don't seriously think the server farms running on M4 Pro Max chips do you...
reply
pjmlp
5 months ago
[-]
Depends on which card one is talking about.
reply
throwaway4good
5 months ago
[-]
Maybe. It is not really obvious how much you for the AI accellerator part of their offerings. For example the chips in iPhones are quite powerful even adjusted for price. However for some cases - like the max chip in the macbooks or the extra ram - their pricing seems high - maybe even nvidia high.
reply
bayindirh
5 months ago
[-]
I mean, even Apple can't match the markups nVidia has right now. If you break a GPU in your compute server, you wait months for a replacement, and the part is sent back if you can't replace it in five days.

Crazy times.

reply
c0balt
5 months ago
[-]
Enterprise offerings tend to differ. You can get a replacement NVIDIA GPU via a partner, like Lenovo, in 2-3 weeks. And that's on the high side for some support contracts.
reply
bayindirh
5 months ago
[-]
That's from HPE, for an Apollo 6500.
reply
grecy
5 months ago
[-]
I'm wondering how my electricity they will save just from moving from Intel to Apple Silicon in their data centers.
reply
derefr
5 months ago
[-]
I think the thing they're saying that's novel, isn't what they have (LoRAs), but where and when and how they make them.

Rather than just pre-baking static LoRAs to ship with the base model (e.g. one global "rewrite this in a friendly style" LoRA, etc), Apple seem to have chosen a bounded set of behaviors they want to implement as LoRAs — one for each "mode" they want their base model to operate in — and then set up a pipeline where each LoRA gets fine-tuned per user, and re-fine-tuned any time the data dependencies that go into the training dataset for the given LoRA (e.g. mail, contacts, browsing history, photos, etc) would change.

In other words, Apple are using their LoRAs as the state-keepers for what will end up feeling to the user like semi-online Direct Preference Optimization. (Compare/contrast: what Character.AI does with their chatbot response ratings.)

---

I'm not as sure, from what they've said here, whether they're also implying that these models are being trained in the background on-device.

It could very well be possible: training something that's only LoRA-sized, on a vertically-integrated platform optimized for low-energy ML, that sits around awake but doing nothing for 8 hours a day, might be practical. (Normally it'd require a non-quantized copy of the model, though. Maybe they'll waste even more of your iPhone's disk space by having both quantized and non-quantized copies of the model, one for fast inference and the other for dog-slow training?)

But I'm guessing they've chosen not to do this — as, even if it were practical, it would mean that any cloud-offloaded queries wouldn't have access to these models.

Instead, I'm guessing the LoRA training is triggered by the iCloud servers noticing you've pushed new data to them, and throwing a lifecycle notification into a message queue of which the LoRA training system is a consumer. The training system reduces over changes to bake out a new version of any affected training datasets; bakes out new LoRAs; and then basically dumps the resulting tensor files out into your iCloud Drive, where they end up synced to all your devices.

reply
Hugsun
5 months ago
[-]
There is no way they would secretly train loras in the background of their user's phones. The benefits are small compared to the many potential problems. They describe some LoRA training infrastructure which is likely using the same capacity as they used to train the base models.

> ...each LoRA gets fine-tuned per user...

Apple would not implement these sophisticated user specific LoRA training techniques without mentioning them anywhere. No big player has done anything like this and Apple would want the credit for this innovation.

reply
wmf
5 months ago
[-]
I don't think the LoRAs are fine-tuned locally at all. It sounds like they use RAG to access data.
reply
derefr
5 months ago
[-]
Consider a feature from earlier in the keynote: the thing Notes (and Math Notes) does now where it fixes up your handwriting into a facsimile of your handwriting, with the resulting letters then acting semantically as text (snapping to a baseline grid; being reflowable; being interpretable as math equations) but still having the kind of long-distance context-dependent variations that can't be accomplished by just generating a "handwriting font" with glyph variations selected by ligature.

They didn't say that this is an "AI thing", but I can't honestly see how else you'd do it other than by fine-tuning a vision model on the user's own handwriting.

reply
Hugsun
5 months ago
[-]
I didn't see the presentation but judging by your description, this is achievable using in-context learning.
reply
wmf
5 months ago
[-]
For everything other than handwriting I don't think the LoRAs are fine-tuned locally.
reply
derefr
5 months ago
[-]
Well, here's another one: they promised that your local (non-iCloud) photos don't leave the device. Yet they will now — among many other things they mentioned doing with your photos — allow you to generate "Memoji" that look like the people in your photos. Which includes the non-iCloud photos.

I can't picture any way to use a RAG to do that.

I can picture a way to do that that doesn't involve any model fine-tuning, but it'd be pretty ridiculous, and the results would probably not be very good either. (Load a static image2text LoRA tuned to describe the subjects of photos; run that once over each photo as it's imported/taken, and save the resulting descriptions. Later, whenever a photo is classified as a particular subject, load up a static LLM fine-tune that summarizes down all the descriptions of photos classified as subject X so far, into a single description of the platonic ideal of subject X's appearance. Finally, when asked for a "memoji", load up a static "memoji" diffusion LoRA, and prompt it with the that subject-platonic-appearance description.)

But really, isn't it easier to just fine-tune a regular diffusion base-model — one that's been pre-trained on photos of people — by feeding it your photos and their corresponding metadata (incl. the names of subjects in each photo); and then load up that LoRA and the (static) memoji-style LoRA, and prompt the model with those same people's names plus the "memoji" DreamBooth-keyword?

(Okay, admittedly, you don't need to do this with a locally-trained LoRA. You could also do it by activating the static memoji-style LoRA, and then training to produce a textual-inversion embedding that locates the subject in the memoji LoRA's latent space. But the "hard part" of that is still the training, and it's just as costly!)

reply
janekm
5 months ago
[-]
That's going to be something similar to IPAdapter FaceID: https://ipadapterfaceid.com Basically you use a facial structure representation that you'd use for face recognition (which of course Apple already compute on all your photos) together with some additional feature representations to guide the image generation. No need for additional fine-tuning. A similar approach could likely be used for handwriting generation.
reply
gokuldas011011
5 months ago
[-]
I believe this could be achieved by providing a seed image to the diffusion model and generating memoji based on it. This way fine tuning isn't required.
reply
raverbashing
5 months ago
[-]
Yup this is pretty much it, and DALLE and others can do this already
reply
throwthrowuknow
5 months ago
[-]
I think you’re misunderstanding what they mean by adapting to use cases. See this passage:

> The adapter models can be dynamically loaded, temporarily cached in memory, and swapped — giving our foundation model the ability to specialize itself on the fly for the task at hand

This along with other statements in the article about keeping the base model weights unchanged says to me that they are simply swapping out adapters on a per app or per task basis. I highly doubt they will fine tune adapters on user data since they have taken a position against this. I wonder how successful this approach will be vs merging the adapters with the base model. I can see the benefits but there are also downsides.

reply
rvaish
5 months ago
[-]
Easel has been on iMessage for a bit now: https://apps.apple.com/us/app/easel-ai/id6448734086
reply
selimnairb
5 months ago
[-]
Very little of the “AI” boom has been novel, most has been iterative elaborations (though innovative nonetheless). Academics have been using neural network statistical models for decades. What’s new is the combination of compute capability and data volume available for training. It’s iterative all the way down though, that’s how all technologies are developed.
reply
sigmoid10
5 months ago
[-]
Most people don't realize this, but almost all research works that way. Only the media spins research as breakthrough-based, because that way it is easier to sell stories. But almost everything is incremental/iterative. Even the transformer architecture, which in some way can be seen as the most significant architectural advancement in AI in the past years, was a pretty small, incremental step when it came out. Only with a lot of further work building on top of that did it become what we see today. The problem is that science-journalists vastly outnumber scientists producing these incremental steps, so instead of reporting on topics when improvements actually accumulated to a big advancement, every step along the way gets its own article with tons of unnecessary commentary heralding its features.
reply
astrange
5 months ago
[-]
The "bitter lesson of machine learning" means that you actually can't do anything novel; it won't work as well as just doing the simple thing but bigger.

(So there is room left if you're limited by memory or budget.)

reply
w10-1
5 months ago
[-]
> What’s new is the combination of compute capability and data volume available for training

This is the important part.

My advisor said new means old method applied to new data or new method on old data.

Commercially, that means price points, i.e., discrete points where something becomes viable.

Maybe that's iterative, but maybe not. Either way, once the opportunity presents, time is of the essence.

reply
WiSaGaN
5 months ago
[-]
This gives me the vibe of calling high resolution screens as "retina" screens.
reply
dishsoap
5 months ago
[-]
I don't see anything wrong with that at all. They've created a branding term that allows consumers to get an idea of the sort of pixel density they can expect without having to actually check, should they not want to bother.
reply
necovek
5 months ago
[-]
Except that everyone has different visual acuity and different distance they use the same devices at, and in the end, "retina" means nothing at all.

But this is exactly the type of marketing Apple is good at, though "retina" is probably not the most successful example.

reply
theshrike79
5 months ago
[-]
If your "visual acuity" is so good that you can see the pixels of a retina-branded display from the intended viewing distance, you might need to be studied for science.
reply
necovek
5 months ago
[-]
If your visual acuity is 20/10, you'd roughly need 3600 pixels vertically to not notice any pixelation if Bill Otto did the calculations right at https://www.quora.com/What-resolution-does-20-10-vision-corr...

20/10 is rare but can easily be corrected to with glasses or contacts.

You also left that "intended viewing distance" hanging there, without at all acknowledging what that is at a minimum?

reply
jackothy
5 months ago
[-]
It's not so impossible to spot flaws if you're using worst-case testing scenarios. Which are not worthless because such patterns do actually pop up in real world usage, albeit rarely.
reply
kolinko
5 months ago
[-]
Examples?
reply
jackothy
5 months ago
[-]
Had one happen to me recently where I was scrolling Spotify, and they do the thing where if you try to scroll past max they will stretch the content.

One of the album covers being stretched had some kind of fine pattern on it that caused a clearly visible shifting/flashing Moiré pattern as it was being stretched.

Wish I could remember what album cover it was now.

Though really it's simple enough: As long as you can still spot a single dark pixel in the middle of an illuminated white screen, the pixels could benefit from being smaller. (Edit: swapped black and white)

reply
kolinko
5 months ago
[-]
The single pixel example is wrong I think, because that’s not how light and eyes work - you can always spot a single point of light, regardless of how small - if it’s bright enough.
reply
jackothy
5 months ago
[-]
Yes exactly that's why I went back right away to edit - if the whole screen is white but one pixel is off and you can spot it, then my logic holds.
reply
ngcc_hk
5 months ago
[-]
Agreed. It is not high resolution as such, but high resolution that the user can relate to - like cannot see the pixel.

Still remember the hard time using Apple newton in a conference vs the palm freely on loan in a Gartner group conference. Palm solved a problem, even though not very Apple … user can input on a small device. I kept it, on top of my newly bought newton.

It is the user …

reply
pyinstallwoes
5 months ago
[-]
Still no manufacturer compares to the quality of apple screens and resolution …
reply
Malcolmlisk
5 months ago
[-]
Those screens are produced by samsung.
reply
PaulRobinson
5 months ago
[-]
By your logic, I own a Foxconn smartphone with a FreeBSD-based OS. If you bought a Porsche, would you call it a Volkswagen?
reply
lostmsu
5 months ago
[-]
Perhaps you should.
reply
mensetmanusman
5 months ago
[-]
Part of the screen is, yes. Apple designs the full stack and sources new technology from multiple suppliers including Samsung.
reply
astrange
5 months ago
[-]
Except for when it's LG, Sharp or BOE.
reply
dwaite
5 months ago
[-]
Or AMD calling monitors which meet quality and feature requirements 'FreeSync'

Or Intel calling USB4 devices and cables which meet quality and feature requirements 'Thunderbolt 5'

Compared to, say, manufacturers who aren't willing to meet any certification requirements or to properly implement the standards at play saying they have "USB-A 3.2 2x2 ports" on their motherboards.

Retina doesn't carry the same weight as an industry certification effort like thunderbolt, but it still informs people that a screen actually meets some sort of bar without them having to evaluate pages of tech specs, and reviews saying whether the tech specs are accurate or have undocumented caveats.

Finally, establishing such certifications are difficult - look at the number of failed attempts at creating industry quality/feature marks in the television market.

reply
viktorcode
5 months ago
[-]
Retina means high pixel density, not high resolution. And there are very few standalone displays on the market which can be called “retina”, unfortunately.
reply
necovek
5 months ago
[-]
Interestingly, I would venture to say that DPI is a measure of resolution: that's the way it's still used in printers or scanners, for example (600 dpi). And retina instead means high angular resolution, or pixels having small arc measures from "appropriate" distances.

The term "resolution" transitioned gradually to mean number-of-individual-lines-rendered horizontally and vertically for displays, but really, the idea is how many dots can you "resolve" (or "resolving power"): a "high resolution" screen or screen mode had a larger number of individual pixels being drawn, which meant that the density is higher. You never talk about a scanner having a resolution of 7200x3600 even if it can scan 12"x6" at 600dpi.

So really, in an informal conversation, I believe both are fine. If you want to be extremely precise, you missed the mark: width and height in pixels is the sanest way to call what you refer to as "resolution".

reply
necovek
5 months ago
[-]
See also https://en.wikipedia.org/wiki/Display_resolution:

> For device displays such as phones, tablets, monitors and televisions, the use of the term display resolution as defined above is a misnomer, though common. The term display resolution is usually used to mean pixel dimensions, the maximum number of pixels in each dimension (e.g. 1920 × 1080), which does not tell anything about the pixel density of the display on which the image is actually formed: resolution properly refers to the pixel density, the number of pixels per unit distance or area, not the total number of pixels. In digital measurement, the display resolution would be given in pixels per inch (PPI).

reply
threeseed
5 months ago
[-]
It's a huge development in terms of it being a consumer-ready, on-device LLM.

And if Karpathy thinks so then I assume it's good enough for HN:

https://x.com/karpathy/status/1800242310116262150

reply
rishabhjain1198
5 months ago
[-]
The productization of it (like Karpathy mentioned) is awesome. But I think the URL for that would be this maybe? [link](https://www.apple.com/apple-intelligence/)
reply
Cthulhu_
5 months ago
[-]
Thing is, Apple takes these concepts and polishes them, makes them accessible to maybe not laypeople but definitely a much wider audience compared to those already "in the industry", so to speak.
reply
marcellus23
5 months ago
[-]
They refer to LoRA explicitly in the post.
reply
rishabhjain1198
5 months ago
[-]
Although I caught that on the first read, I found myself questioning when I read the adaptors part, "is this not just LoRA...".

Maybe it's my fault as a reader, but I think the writing could be clearer. Usually in a research paper you would link to the LoRA paper there too.

reply
monkeydust
5 months ago
[-]
Feel Apple should have just focused on their models for this one and not complicate the conversation with OpenAI. They could have left that to another announcement later.

Quick straw poll survey around the office, many think their data will be sent off to OpenAI by default for these new features which is not the case.

reply
scosman
5 months ago
[-]
I think you’re referring to my comment about this being huge for developers?

Just want to point out I call this launch huge, didn’t say “huge development” as quoted, and didn’t imply what was interesting was the ML research. No one in this thread used the quoted words, at least that I can see.

My comment was about dev experience, memory swapping, potential for tuning base models to each HW release, fine tune deployment, and app size. Those things do have the potential to be huge for developers, as mentioned. They are the things that will make a local+private ML developer ecosystem work.

I think the article and comment make sense in their context: a developer conference for Mac and iOS devs.

Apple also explicitly says it’s LoRA.

reply
franzb
5 months ago
[-]
This isn't about AI research, it's about delivering AI at unimaginable scale.
reply
throwthrowuknow
5 months ago
[-]
180 million users for chatgpt isn’t unimaginable but it does exceed the number of iPhone users in the United States.
reply
frompom
5 months ago
[-]
Do you have the same expectations for any company launching hardware that they cite the various papers related to how the tech was developed? EVERY piece of tech announced by ANY company relies on a variety of research out there yet it doesn't seem expected that every time anyone launches something they cite the numerous papers related to that. Why would products/services in this category be any different?
reply
talldayo
5 months ago
[-]
> Do you have the same expectations for any company launching hardware that they cite the various papers related to how the tech was developed?

If they try to market it with a seemingly unique or yet-unheard of name, then yeah. It is nice knowing what the "real world" name of an Apple-ized technology is.

Just ignoring it and marketing the technology under some new name is adjacent to lying to your audience through omission.

reply
dwaite
5 months ago
[-]
> Just ignoring it and marketing the technology under some new name is adjacent to lying to your audience through omission.

They don't market technology, they market solutions. E.g. afib detection on Apple Watch, rather than calling it a BNNS using a custom-built sensor for one-wire EKG.

This is the document where they describe how the solution works, and they clearly state adapters work based on LoRA.

reply
arvinsim
5 months ago
[-]
> The way they wrote about "adaptors" can make it seem like it's something novel, but it's actually just re-iterating vanilla LoRA.

That's a classic Apple strategy though.

reply
gigglesupstairs
5 months ago
[-]
Was there anything about searching through our own photos using prompts? I thought this could be pretty amazing and still a natural way to find very specific photos in one’s own photo gallery.
reply
mazzystar
5 months ago
[-]
Run OpenAI's CLIP model on iOS to search photos. https://github.com/mazzzystar/Queryable
reply
gigglesupstairs
5 months ago
[-]
Yes, exactly this. I have had this for a while and works wonderfully well in most cases but it’s wonky and not seamless. I wanted a more integrated approach with Photos app which only Apple can bring to the table.
reply
avereveard
5 months ago
[-]
Which is in turn just multimodal embedding

Besides I could do "named person on a beach in August" and get the correct thing in photos on Android photos, so I don't get it.

It's amazing for apple users if they didn't have it before. But from a tech stand point people could have had it for a while.

reply
azinman2
5 months ago
[-]
Photos has had this for a while with structured natural language queries, and this kind of prompt was part of the WWDC video.
reply
theshrike79
5 months ago
[-]
The difference is that Apple has been doing this on-device for maybe 4-5 years already with the Neural Engine. Every iOS version has brought more stuff you can search for.

The current addition is "just" about adding a natural language interface on top of data they already have about your photos (on device, not in the cloud).

My iPhone 14 can, for example, detect the breed of my dog correctly from the pictures and it can search for a specific pet by name. Again on-device, not by sending my stuff to Google's cloud to be analysed.

reply
fauigerzigerk
5 months ago
[-]
They have been trying and failing to do a tiny little bit of this. It's so broken and useless that I've been uploading all my iCloud photos to Google as well, for search and sharing.
reply
theshrike79
5 months ago
[-]
If you like Google using your personal photos for machine learning, that's your option. Now they have your every photo, geotagged and timestamped so they can see where you have been and at what times. Then they of course anonymise that information into an "advertiser id" they tag on to you and a sufficient quantity of other people so they can claim they're not directly targeting anyone.

I prefer Apple's privacy focused option myself.

reply
fauigerzigerk
5 months ago
[-]
>If you like Google using your personal photos for machine learning, that's your option.

It's a trade-off between getting the features I need and the price I have to pay. All else being equal I do prefer privacy as well. Unfortunately, all else is not equal.

>I prefer Apple's privacy focused option myself.

It's only an option if it works.

reply
lhl
5 months ago
[-]
I think your conclusion is uncharitable or at least depends on how deep your interest in AI research actually is. Reading the docs, there are at least several points of novelty/interest:

* Clearly outlining their intent/policies for training/data use. Committing to no using user data or interactions for training their base models is IMO actually a pretty big deal and a differentiator from everyone else.

* There's a never-ending stream of new RL variants ofc, but that's how technology advances, and I'm pretty interested to see how these compare with the rest: "We have developed two novel algorithms in post-training: (1) a rejection sampling fine-tuning algorithm with teacher committee, and (2) a reinforcement learning from human feedback (RLHF) algorithm with mirror descent policy optimization and a leave-one-out advantage estimator. We find that these two algorithms lead to significant improvement in the model’s instruction-following quality."

* I'm interested to see how their custom quantization compares with the current SoTA (probably AQLM atm)

* It looks like they've done some interesting optimizations to lower TTFT, this includes the use of some sort of self-speculation. It looks like they also have a new KV-cache update mechanism and looking forward to reading about that as well. 0.6ms/token means that for your average I dunno, 20 token query you might only wait 12ms for TTFT (I have my doubts, maybe they're getting their numbers from much larger prompts, again, I'm interested to see for myself)

* Yes, it looks like they're using pretty standard LoRAs, the more interesting part is their (automated) training/re-training infrastructure but I doubt that's something that will be shared. The actual training pipeline (feedback collection, refinement, automated deployment) is where the real meat and potatoes of being able to deploy AI for prod/at scale lies. Still, what they shared about their tuning procedures is still pretty interesting, as well as seeing which models they're comparing against.

As this article doesn't claim to be a technical report or a paper, while citations would be nice, I can also understand why they were elided. OpenAI has done the same (and sometimes gotten heat for it, like w/ Matroyshka embeddings). For all we know, maybe the original author had references, or maybe since PEFT isn't new to those in the field, that describing it is just being done as a service to the reader - at the end of the day, it's up to the reader to make their own judgements on what's new or not, or a huge development or not. From my reading of the article, your conclusion, which funnily enough is now the new top-rated comment on this thread isn't actually much more accurate the the one old one you're criticizing.

reply
steve1977
5 months ago
[-]
You know what company you are talking about here?
reply
rvaish
5 months ago
[-]
reminds me of Easel on iMessage: https://easelapps.ai/
reply
Spooky23
5 months ago
[-]
Those people aren’t looking at Apple.

They seem to have a good model for adding value to their products without the hold my beer, conquer the world bullshit that you get from OpenAI, et al.

reply
kfrzcode
5 months ago
[-]
"AI for the rest of us."
reply
wkat4242
5 months ago
[-]
Except Apple isn't really for the rest of us. Outside of America and a handful wealthy western countries it's for the top 5-20% earners only.
reply
jahewson
5 months ago
[-]
Approximately 33% of all smartphones in the world are iPhones.
reply
rvnx
5 months ago
[-]
60% in the US
reply
dwaite
5 months ago
[-]
Maybe the suspicion is the top 5% carry six smartphones?
reply
throwaway2037
5 months ago
[-]
Japan and Taiwan are both more than 50% iOS.

Ref: https://worldpopulationreview.com/country-rankings/iphone-ma...

reply
whynotminot
5 months ago
[-]
Who do you think this presentation is geared toward?
reply
theshrike79
5 months ago
[-]
In the EU the market share is 30%
reply
d1sxeyes
5 months ago
[-]
Yes but not evenly distributed, BeNeLux, Germany, Austria, and Nordic countries have a lot of iPhone users, while moving further east (or south) you see lower market share. Maybe it’s “two handfuls” of wealthy western countries rather than just one, but I think OPs point holds true.
reply
theshrike79
5 months ago
[-]
https://worldpopulationreview.com/country-rankings/iphone-ma...

Poland, Greece, Hungary and Bosnia-Herzegovina are the only ones under 20% (and maybe a few others).

OTOH Britain is over 50% as is Sweden. Finland, the land of Nokia is over 35%.

reply
XajniN
5 months ago
[-]
One average American user is probably worth 5-10 average European users.
reply
theshrike79
5 months ago
[-]
(I've dabbled in mobile games)

Yes. Americans are THE most valuable customer base, y'all use insane amounts of money on mobile crap.

reply
dwaite
5 months ago
[-]
And non-mobile crap. I recently purchased a structure for my backyard on my phone.
reply
kolinko
5 months ago
[-]
In Poland it’s 33%
reply
d1sxeyes
5 months ago
[-]
reply
kolinko
5 months ago
[-]
Ah, I stand corrected. Not sure where I found that high number.
reply
elbear
5 months ago
[-]
In Romania it's 24.7%
reply
d1sxeyes
5 months ago
[-]
Huh interesting, I missed that. You’re right (actually I see even 25.5%).
reply
chuckjchen
5 months ago
[-]
This sounds like every newcomers to the stage except for big players like Apple.
reply
cube2222
5 months ago
[-]
Halfway down the article contains some great charts with comparisons to other relevant models, like Mistral-7B for the on-device models, and both gpt-3.5 and 4 for the server-side models.

They include data about the ratio of which outputs human graders preferred (for server side it’s better than 3.5, worse than 4).

BUT, the interesting chart to me is „Human Evaluation of Output Harmfulness” which is much, much ”better„ than the other models. Both on-device and server-side.

I wonder if that’s part of wanting to have gpt as the „level 3”. Making their own models much more cautious, and using OpenAI’s models in a way that makes it clear „it was ChatGPT that said this, not us”.

Instruction following accuracy seems to be really good as well.

reply
crooked-v
5 months ago
[-]
I want to know what they consider "harmful". Is it going to refuse to operate for sex workers, murder mystery writers, or people who use knives?
reply
hotdogscout
5 months ago
[-]
I bet it's the usual double standards the AI one percenters cater to.

No sex because apparently it's harmful yet never explained why.

No homophobia/transphobia if you're Christian but if you're Muslim it's fine.

reply
m463
5 months ago
[-]
Bet it depends on the country.

In the USA, you won't be able to ask about sex, but you can probably ask about tank man.

reply
jeroenhd
5 months ago
[-]
I would've thought the same until Microsoft started hiding tank man results in Bing. I'm not so sure if companies will start training different models for every oppressive regime.
reply
t-writescode
5 months ago
[-]
Oh?

https://www.bing.com/images/search?q=tank%20man

Looks visible, to me.

Tiananmen Square even shows Tank Man on the first page, 13th and 15th entry, for me. Admittedly, I expected it more quickly on Tiananmen Square; but that might be because I, as a person, forgot that it's also a literal square with more stuff going on at it than a single moment in history.

reply
lelandfe
5 months ago
[-]
2021, a classic, “we made a whoopsie” response.

https://theguardian.com/technology/2021/jun/04/microsoft-bin...

reply
arthur_sav
5 months ago
[-]
They'll inject whatever ideology / dogma is "the current thing" into this.
reply
rvnx
5 months ago
[-]
"Think Different"
reply
its_ethan
5 months ago
[-]
The caption for the image gives a little more insight into "harmful" and one of the things it mentions is factuality - which is interesting, but doesn't reveal a whole lot unless they were to break it out by "type of harmful".
reply
Aerbil313
5 months ago
[-]
None of the use cases they presented in WWDC using Apple Intelligence was creative writing. There is one, that uses ChatGPT explicitly:

> And with Compose in Writing Tools, you can create and illustrate original content from scratch.

https://www.apple.com/apple-intelligence/

reply
causal
5 months ago
[-]
Refusing to answer any question would result in a perfect score for the first chart since it says nothing of specificity
reply
tonynator
5 months ago
[-]
So it's not going to be better than other models, but it will be more censored. I guess that might be a selling point for their customer base?
reply
dghlsakjg
5 months ago
[-]
iPhone share is ~59% of smartphones in the US.

Their customer base is effectively all demographics.

reply
tonynator
5 months ago
[-]
Those who dislike censorship and enjoy hacking avoid iPhones for obvious reasons.
reply
overstay8930
5 months ago
[-]
People who understand cybersecurity hygiene use iPhones for obvious reasons
reply
Hugsun
5 months ago
[-]
The reasons are very not obvious to me. Could you elaborate?
reply
overstay8930
5 months ago
[-]
There is no other ecosystem that will provide easy E2EE in the cloud, that’s reason enough for the vast majority of users who are just sick of breaches because of pure sysadmin incompetence.

There are plenty of examples in every Apple service or even their accessories (i.e. wireless keyboard encryption). FaceTime is hardened from even theoretical attacks that could probably only be performed by 5 eyes like transcribing E2EE calls based on bandwidth use (FaceTime has built mitigations around this attack vector).

reply
astrange
5 months ago
[-]
iPhone security is very good, better than, say, your desktop even though you don't carry your desktop everywhere you go. (Some Android security is also very good, depends on the hardware though.)
reply
Aerbil313
5 months ago
[-]
People who understand cybersecurity who are not operating within a US-allied country use ... I don't know what to be honest. What to do in such a situation, where Apple is a US-based company obligated by law to comply with requests from three letter agencies and Android is a buggy mess which probably is backdoored by every major power?
reply
astrange
5 months ago
[-]
Your threat model shouldn't be about three letter agencies unless you're running a terrorist group. It should be about porn spammers or something.
reply
Der_Einzige
5 months ago
[-]
I can assure you that if you’re doing AI research, glowies likely spy on you, but it’s the “good” kind of spying where you might get a job offer some day instead of the kind where they Guantanamo bay you…
reply
Aerbil313
5 months ago
[-]
Unless of course you are not based in a US-allied country, as stated very explicitly in my comment.

Ah the neverending American centricism on this site.

reply
Aerbil313
5 months ago
[-]
Do you mean that US is the measure of justice and goodness in the world and anyone who opposes her is a terrorist? I don't mean to detract the conversation but this is really victim blaming. There are valid situations where the US DoD is a threat.

Your username reminded me of Julian Assange, for a second I mixed them up.

reply
astrange
5 months ago
[-]
No, I assume if you're working for a foreign military then you wouldn't be posting about it on here.
reply
Aerbil313
5 months ago
[-]
I assume if I was working for a foreign power I'd already know the answer to my original question... unless there's no good answer.
reply
overstay8930
5 months ago
[-]
Sorry to break it to you but you cannot hide from the US government if you use modern hardware or connect to the internet in any way.
reply
Aerbil313
5 months ago
[-]
How likely is it that seemingly well-intentioned companies/open source projects like Mullvad, Wireguard, Qubes OS, Monero etc. are infiltrated by the US government?

Or are you talking about the presence of Intel ME and the like in modern hardware?

reply
overstay8930
5 months ago
[-]
I’m talking about the simple fact that Five Eyes have so many resources put together that you will never hide forever, unless you are willing to drastically change your life.

The good news is you are not important enough for them to care about you unless you are an Iranian general with nuclear access or some shit. Even these ransomware groups aren’t even on the radar of the people who are actually being targeted, I’m talking about stopping the yakuza trying to sell nukes to terrorists level of threat.

reply
Aerbil313
5 months ago
[-]
I appreciate you, and thanks for the "You're not that important" reminder for the average reader of this comment. However I must state I personally really am in need of an answer. Hypothetically, what if I am that important? What if I am not yet a target but think there's a decent chance that I will be targeted by nation-states, the most capable being US one day? You seem pretty knowledgeable. In the age of ML-assisted mass data scanning and ISP data collection & correlation, what would the lifestyle changes I must undergo to remain truly pseudonymous on the internet look like?
reply
realusername
5 months ago
[-]
Those cybersecurity experts are using GrapheneOS and certainly not an iPhone where they can't even check if all is going well...
reply
jitl
5 months ago
[-]
I’ve worked with a few people from NCC Group, Matasano, security staff at Airbnb and OpenAI, all carry and recommend iPhones for security footing. Depending on threat model, “lockdown mode” on iOS has a lot of what is useful in Grapheme like turning off built in connectivity services & disabling JIT and other code paths in the webview.
reply
realusername
5 months ago
[-]
To each their own, I certainly would not recommend an iPhone in this scenario, especially even more at a top tech company.
reply
binkethy
5 months ago
[-]
It would seem that integrating a backdoor funnel to OpenAI is a bit of a security issue to those who care about such things.

Yay, we can all train corporate models for free involuntarily.

I guess it's time to check out Lineage OS and Postmarket OS. It was always a matter of time.

reply
scarface_74
5 months ago
[-]
No one is going to train an AI on random user generated data. The data is going to be horrible and it’s going to be full of PII that’s too risky to expose.
reply
duxup
5 months ago
[-]
I feel that way, have an iPhone.
reply
ihumanable
5 months ago
[-]
Yea, same.

I have literally no desire to hack and fuck around with my personal cell phone, doing so would take away time from the hacking I actually want to do.

reply
tonynator
5 months ago
[-]
One option you can't hack on, the other you can but don't have to at all. Why choose the former? Are you so sure you'll never want to do anything on your phone that's unapproved by your corporate overlords?
reply
gfourfour
5 months ago
[-]
How does an iPhone contribute to censorship?
reply
tonynator
5 months ago
[-]
reply
ksec
5 months ago
[-]
I hope, this could mean Apple will push the baseline of ALL Macs to have higher than 8GB of Memory. While I wish we all get 16GB M4 as baseline. Apple being Apple may only give us 12GB, and charges extra $100 for the 16GB option.

It will still be a lot better than 8GB though.

reply
talldayo
5 months ago
[-]
The Steam Deck ships with 16 gigs of quad-channel LPDDR5 and it costs $400. Apple knows exaaaactly what they're doing with this sort of pricing.

Can't forget about that cozy 256gb SSD either. An AI computer will need more than that, right?

reply
wraptile
5 months ago
[-]
RAM is literally the cheapest primary component in a laptop at going rate of 1-4usd/GB. I'd say that shipping 8GB base model in 2024 is clearly manipulation by Apple, i.e. planned obsolescence or a way to moat Apple software. Anyone who doesn't see this is just being delusional.

Same way Apple and Samsung ship 128GB of storage when the production price between 128gb and 1tb is like 10$ (on a 1000$ device). Samsung even got rid of micro sd slot. It's so blatant it's actually depressing.

reply
glial
5 months ago
[-]
> RAM is literally the cheapest primary component

Is that still true for Apple's integrated memory? It might be - I just don't know.

reply
talldayo
5 months ago
[-]
For the LPDDR4 and LPDDR5 that goes into the M1 and M2/M3 systems, yes. You might need to spend more money on memory controllers (since M1 and up is 8-channel) but the physical memory component itself is highly availible and relatively cheap. Same goes for SSD storage, nowadays.
reply
p_l
5 months ago
[-]
The memory used by Apple isn't anything magical or special - it's bog standard LPDDR5, essentially same as phone - and in a laptop it's way less limited by thermal and power constraints to add more (which is how you have the rather large possible set of options).

While going for the top tier of memory sizes Apple offers does cost considerable amounts, making 16, or even 32GB standard is peanuts.

reply
rfoo
5 months ago
[-]
> integrated memory

Yes. The cost of bonding memory to their chip is mostly the same for 8G / 16G / 32G / practically any number.

reply
PaulRobinson
5 months ago
[-]
On the number of devices they sell, an extra $64 of cost per device (taking your higher figure and assuming an extra 8GB), across Mac, iPad and iPhone, they'd be looking at a cost of ~$12.8bn a year. If they just did it for Mac, it's still in the region of $2bn/year.

Sure they could pass that onto a mostly price-insensitive audience, but they like round numbers, and it's not the size of decision you take without making sure its necessary: that your customers are going to go elsewhere in either scenario of doing it or not doing it.

reply
pjmlp
5 months ago
[-]
Typing this from a Samsung with SD slot, need to chose your models wisely.
reply
zer0zzz
5 months ago
[-]
Is steamdeck sold at cost? From what I know Apple has a rule that everything must be sold at 40% margins. That is prob the main reason.
reply
makeitdouble
5 months ago
[-]
> From what I know Apple has a rule that everything must be sold at 40% margins.

As for all rules, it's a rule except when it's not. On the top of my head Apple TV [0] had a 20% predicted margin presumably because they wanted to actually sell them.

Otherwise 40% margin is usually calculated against the BOM, which doesn't mean 40% of actual profit when the product is sold.

In that respect we have no idea of the actual margin on a macbook air for instance, it could be 10% when including their operating costs and marketing, or it could 60% if they negociated prices way below the estimated BOM for instance.

It's just to say: Apple sells at 8Gb because they want to, at the end of the day nothing is stopping them to play with their margin or the product price.

[0] https://www.reuters.com/article/idUSN06424767/

reply
talldayo
5 months ago
[-]
As a consumer I really cannot be made to care why it's the case. This artificial price tiering is stupid and everyone has been calling it a scam for years. Apple clearly knows they're in the wrong, but continues because they know nobody can stop them.
reply
andruby
5 months ago
[-]
From a business perspective it's not _stupid_. It sucks for us customers, but it's "smart" from a business point of view.

Until they get serious competition, I doubt they'll change their practices.

And while I hate the overpriced memory upgrades, I still prefer paying extra, rather than Apple switching to a Ad-based business model like Google (and potentially OpenAI in the future)

reply
talldayo
5 months ago
[-]
> From a business perspective it's not _stupid_. It sucks for us customers, but it's "smart" from a business point of view.

Well, I'm not a business. I appreciate smart consumer choices and I applaud any company that doesn't have to be forced into doing the right thing.

> I still prefer paying extra, rather than Apple switching to a Ad-based business model like Google (and potentially OpenAI in the future)

Oh you sweet summer child. You think Apple doesn't also have an ad-based business model on top of that?

I switched to Linux after MacOS Mojave, and I do not miss any of this brouhaha one bit. It's almost rich hearing people talk about how few ads MacOS has, when it's constantly begging you to try or pay for Apple software services. Even Android isn't as ad-ridden as MacOS, the only victory Apple can claim is relative to Windows (which is a grim reflection of MacOS's eventual service-dominated fate).

You should try out Linux, though. It's a culture shock, trying to get work done with no inbuilt advertisement whatsoever. I could never go back to Mac or Windows and be this productive.

reply
scottyah
5 months ago
[-]
Business and consumers both benefit from not using a cost-plus pricing model in a marketspace heavily dependent on R&D. Sure, the profit margins are hard to agree upon since it's an infinite scale and they change heavily day to day, but do not trust anyone who thinks all product pricings should be based solely on cost of materials.
reply
pjmlp
5 months ago
[-]
Yes they can, buy something else.
reply
talldayo
5 months ago
[-]
I've been doing that for six fucking years and not a single thing has changed. The base memory and storage has not changed in that time. During that same amount of time:

- Macs transitioned to Apple Silicon, got rid of dGPU memory

- Baseline Macbook Air models increased in price by $100

- AI became a realistic and usable technology

- Gaming is slightly feasible with GPTK

Of course we shouldn't be starting at 8 gigs of memory. This is highway robbery and the only thing you can say in defense is "buy something else then"

reply
pjmlp
5 months ago
[-]
When I know I will get robbed in the highway, I take another safer road instead, instead of playing lucky and complaining afterwards.
reply
brandall10
5 months ago
[-]
It's been speculated that base config macbooks essentially act as loss leaders for higher end configs, so overall, probably sales across the line net somewhere around that. The cost of the upgrades themselves can get to multiple times the actual market cost.
reply
breuleux
5 months ago
[-]
I feel like this is very obviously what they are doing: they have a small margin on the entry config where people are a lot more price-aware, and jack up the upgrade costs to get a fat margin on configs bought by orgs and power users who generally care less about how much it costs. Frankly, I'd do the same thing.
reply
scottyah
5 months ago
[-]
I always thought the products like the $700 computer wheels[1] and $1,000 monitor stands[2] are made exclusively for the companies who are trying to signal their wealth. For anyone who knows how much these accessories cost, walking into an office where all the workstations have these components puts out a message of wealth (much like standing desks, Herman Miller chairs, expensive art, super espresso machines, etc).

I don't think it tarnishes their reputation as long as the products are both actually good quality and there are inexpensive alternatives. Why not get some easy cash while letting your customers display their vanity? It reminds me of the "I am rich" app[3].

[1] https://www.apple.com/shop/product/MX572ZM/A/apple-mac-pro-w... [2] https://www.apple.com/shop/product/MWUG2LL/A/pro-stand [3] https://en.wikipedia.org/wiki/I_Am_Rich

reply
talldayo
5 months ago
[-]
> Why not get some easy cash while letting your customers display their vanity?

Because computer wheels and monitor stands are high-demand utilities, not jewelry or makeup.

reply
creshal
5 months ago
[-]
40% margin on parts that cost tens of dollars isn't going to have a huge impact on the sticker price of devices costing hundreds to thousands.
reply
torginus
5 months ago
[-]
I remember hearing that Apple's researching running AI models straight from flash storage (which would make immense amount of sense imo). You could create special, high read bandwidth flash chips (which would probably involve connecting a fast transciever in parallel to the 3D flash stack).

If you could do that, you could easily get hundreds of GB/s read speed out of simple TLC flash.

Obviously this is the future, but I think it's a promising one.

reply
dgellow
5 months ago
[-]
The have insane margin on ram and storage, it would be really surprising to see them move away from their current strategy
reply
keyle
5 months ago
[-]
It probably will change. Note that, so far, a 16GB apple device has much better usability than the equivalent on windows. This may sound biased, but the memory compression and foreground/background actions by macOS tight integration with the hardware is really good. I've never felt like I couldn't do things on smaller hardware, except (larges) LLMs.

Also when I compare with my co-workers the memory pressure is a lot less running the same software on macOS than Windows. This might have to be due to the UI framework at play.

But that said, I totally agree that Apple is doing daylight robbery with their additional RAM pricing, and the minimum on offer is laughable.

reply
Aeolun
5 months ago
[-]
Any apple device has much better usability than a windows machine, regardless of RAM.
reply
wwtrv
5 months ago
[-]
> This may sound biased,

It certainly does, close to irrational even. IIRC memory compression is enabled by default on Windows as well.

reply
dialup_sounds
5 months ago
[-]
Biased and irrational are both things HN readers say to avoid using the word "subjective".
reply
snemvalts
5 months ago
[-]
The swapping is indeed faster as the SSD is on the SoC and so fast to access. To the point that an 4 year old 8gb M1 Air is enough for simpler development work, at least for me.
reply
andreasmetsala
5 months ago
[-]
SSD on chip might be a thing one day but I’m pretty sure only the RAM is on the same chip.
reply
pests
5 months ago
[-]
I would think any 4 year old 8gh laptop would be enough for simpler development work.
reply
vishnugupta
5 months ago
[-]
Does it matter what the baseline memory is as long as they have 16GB M4 as an option?
reply
manmal
5 months ago
[-]
Some companies give their employees only base models.
reply
joshstrange
5 months ago
[-]
Some companies are stupid, news at 6.

It's not Apple's (or any computer manufacturer's) responsibility to put out products that can solve every problem with the base model.

I'll never understand why companies pay high salaries then give employees sub-optimal computers to do their job.

reply
manmal
5 months ago
[-]
I understand your frustration, but I don’t appreciate the sarcasm in your response.
reply
ndgold
5 months ago
[-]
Absolutely awesome amount of content in these two pages. This was not expected. It is appreciated. I can’t wait to use the server model on a Mac to spin up my own cloud optimized for the Apple stack.
reply
solarkraft
5 months ago
[-]
What makes you think you'll get that model?

Edit: I see they're committing to publishing the OS images running on their inference servers (https://security.apple.com/blog/private-cloud-compute/). Would be cool if that allowed people to run their own.

reply
msephton
5 months ago
[-]
Apparently they will in a VM but it seems perhaps only security researchers?
reply
whazor
5 months ago
[-]
It would be much cooler if enterprises can swap to their custom models in their own clouds.
reply
rekoil
5 months ago
[-]
> Would be cool if that allowed people to run their own.

Oh my god that would be absolutely amazing!

reply
titaniumtown
5 months ago
[-]
Did it mentioned being able to spin up the server model locally? I must've missed that part in the article.
reply
theshrike79
5 months ago
[-]
They didn't but I'll bet it's coming in the next 5 years.

Most likely integrated with an Apple TV or a similar thing. Enough local LLM processing power to handle a family's data all in-house.

reply
vzaliva
5 months ago
[-]
I love that they use machinelearning.apple.com not ai.apple.com
reply
tmpz22
5 months ago
[-]
For the majority of the keynote they explicitly avoided the word AI instead substituting the word Intelligence, then Apple Intelligence, and then towards the end they said AI and ChatGPT once or twice.

I think they saw the response to all the AI shoveling and Microsoft Recall and executed a fantastic strategy to reposition themselves in industry discussions. I still have tons of reservations about privacy and what this will all look like in a few years, but you really have to take your hat off to them. WWDC has been awesome and it makes me excited to develop for their platform in a way I haven't felt in a very, very, long time.

reply
worstspotgain
5 months ago
[-]
> executed a fantastic strategy to reposition themselves in industry discussions

Just the usual marketing angle, IMO. It's not TV, it's HBO.

No one is reluctant to use the word smartphone to include iPhones. I don't think anyone is going to use the Apple Intelligence moniker except in the same cases where they'd say iCloud instead of cloud services.

It's also a little clunky. Maybe they could have gone with... xI? Too close to the Chinese Xi. iAI? Sounds like the Spanish "ay ay ay." Not an easy one I think. The number of person-hours spent on this must have been something.

reply
tmpz22
5 months ago
[-]
I don't think they actually expect "Apple Intelligence" to enter popular vernacular. I think it was more to drive home the distinction between what Apple is doing and what everybody else is doing.
reply
andsoitis
5 months ago
[-]
> distinction between what Apple is doing and what everybody else is doing

it is artificial intelligence, applied intelligently.

In Apple's case: "personalised AI system"

reply
swyx
5 months ago
[-]
correct. last year instead of VR they went with Spatial Intelligence
reply
hbn
5 months ago
[-]
Vision Pro isn't really designed to be a VR device first and foremost. The primary usecase is the passthrough mode whereas VR usually describes the software putting you in a different place.
reply
spogbiper
5 months ago
[-]
"spatial computing"
reply
seydor
5 months ago
[-]
> makes me excited to develop for their platform in a way I haven't felt in a very, very, long time

AI will ultimately do all the 'development', and will replace all apps. The integrations are going to be a temporary measure. Only apps that will survive are the ones that control things that apple cannot control (ie. how Uber controls its fleet)

reply
Hugsun
5 months ago
[-]
Perhaps. It will be exciting to see if/how that happens. It does seem relatively far off still. At least some years.
reply
dgellow
5 months ago
[-]
What excites you specifically as a developer?
reply
tmpz22
5 months ago
[-]
For the last 10 years I've been a full stack / devops developer. I think that ecosystem is in a very bad place right now and has failed to efficiently modernize. The tools that people are adopting in an attempt to mitigate this such as NextJS are still grounded in the complexities of the past like Node/React/Express/Serverless and not good enough.

These troubles metastasize to subpar SaaS products, low efficiency, bad company cultures, layoffs, bad hiring practices, management instead of leadership, salary stagnation, dark patterns, you name it.

So to see Apple with a laser focus on tooling, quality of life, privacy, in this WWDC while everyone else runs around like a headless chicken suggests to me that their platform might be the more lucrative path to follow. I think it'll be faster, better, and more enjoyable, to develop consumer and business applications for fun and profit.

Don't get be wrong its far from a silver bullet. Many Apple Platform APIs like CloudKit and Server-side-Swift have a LONG way to go. But Im seeing the right steps to address these issues and at the end of the day it feels a whole lot better then what I've been doing in the past and produces better end products IMO.

reply
andbberger
5 months ago
[-]
glad someone sane is in charge in cupertino
reply
okdood64
5 months ago
[-]
Apple Intelligence.
reply
bfung
5 months ago
[-]
Waiting for aiPhone in a few iterations </troll>
reply
xwolfi
5 months ago
[-]
Yeah they probably were still working on the last buzzword
reply
w10-1
5 months ago
[-]
I think we as tech people lost the forest for the trees.

Apple (unwisely I think) is allowing UI's to just generate responses.

The wow-neat! experience will wear off quickly. Then even as a miss rate of 0.1%, there will be thousands - millions - of cringe-worthy examples that sully the Apple brand for quality.

It will be impossible to create quality filter good enough, and there will be no way to back these features out of the OS.

For targeted use-cases (like coding and editing), this will be useful. But these features may be what finally makes contempt for Apple go mainstream, and that would be a shame.

Internally at Apple, they likely discussed how much to limit the rollout and control usage. I think they decided to bake it into API's more to maintain developer mindshare than to keep users happy.

The one feature that could flip that script is interacting with Siri/AI in order to get things done. The frustration with knowing what you want but not how or whether it can be done drives a lot of tech angst. If this only meant ordinary people could use their existing phones to their full extent, it would be a huge win.

reply
s3p
5 months ago
[-]
"that sully the Apple brand for quality."

OK. No one remembers Apple Maps, the CSAM scanning, the crush ad, etc? Companies do embarrassing stuff all the time. At least they're trying.

reply
scottyah
5 months ago
[-]
I agree, they're joining in on the slippery slope auto-correct, home assistants, and Self Driving.

I think it's been awhile since consumers have trusted or relied on consumer tech. Browsing the web from a phone can only be described as adversarial. Scrolling down a top google result recipe site is almost impossible. Texts don't always send and you can't keep up with all the cloud backup offerings that it's hard to tell if your photos are actually being saved.

The current political and media scene is often described as post-truth, where accuracy isn't the biggest driving factor. It seems that computation is headed that way as well.

reply
ra7
5 months ago
[-]
> Our foundation models are trained on Apple's AXLearn framework, an open-source project we released in 2023. It builds on top of JAX and XLA, and allows us to train the models with high efficiency and scalability on various training hardware and cloud platforms, including TPUs and both cloud and on-premise GPUs.

Interesting that they’re using TPUs for training, in addition to GPUs. Is it both a technical decision (JAX and XLA) and a hedge against Nvidia?

reply
m-s-y
5 months ago
[-]
They’d be silly not to hedge. Anyone, in fact, would be silly. It to hedge. On pretty much everything.
reply
anvuong
5 months ago
[-]
Jax was built with TPUs in mind, so it's not surprising that they use TPUs
reply
gokuldas011011
5 months ago
[-]
"Use the best tool available"
reply
flakiness
5 months ago
[-]
They hired people nearby. Conveniently there is a small town called Mountain View.
reply
dingclancy
5 months ago
[-]
It’s interesting that a sub-ChatGPT 3.5 class model can do a lot of things on-device if you marry it with a good platform and feed it personal context. GPT-4o, living on the browser, is not as compelling as a product compared to what Apple Intelligence can do on the iPhone with a less capable model.
reply
aixpert
5 months ago
[-]
their 3 billion parameter model can't do shit, Only some basic grammar check style rewrite and maybe summarization
reply
pertymcpert
5 months ago
[-]
Have you tried it much?
reply
miven
5 months ago
[-]
> For on-device inference, we use low-bit palletization, a critical optimization technique that achieves the necessary memory, power, and performance requirements.

Did they go over the entire text with a thesaurus? I've never seen "palletization" be used as a viable synonym for "quantization" before, and I've read quite a few papers on LLM quantization

reply
bagrow
5 months ago
[-]
reply
miven
5 months ago
[-]
Huh, generally whenever I saw the lookup table approach in literature it was also referred to as quantization, guess they wanted to disambiguate the two methods

Though I'm not sure how warranted it really is, in both cases it's still pretty much the same idea of reducing the precision, just with different implementations

Edit: they even refer to it as LUT quantization on another page: https://apple.github.io/coremltools/docs-guides/source/quant...

reply
astrange
5 months ago
[-]
Just "quantization" is poor wording for that. Quantization means dropping the low bits.

Sounds like it was confused with "vector quantization" which does involve lookup tables (codebooks). But "palletization" is fine too.

reply
fudged71
5 months ago
[-]
404
reply
miven
5 months ago
[-]
Yeah, it just got updated, here's the new link, they added sections on block-wise quantization for both the rounding-based and LUT-based approach: https://apple.github.io/coremltools/docs-guides/source/opt-p...
reply
elcritch
5 months ago
[-]
Huh, it’s PNG for AI weights.
reply
cgearhart
5 months ago
[-]
I also found it confusing the first time I saw it. I believe it is sometimes used because the techniques for DL are very similar (in some cases identical) to algorithms that were developed for color palette quantization (in some places shortened to "palettization"). [1] At this point my understanding is that this term is used to be more specific about the type of quantization being performed.

https://en.wikipedia.org/wiki/Color_quantization

reply
dialup_sounds
5 months ago
[-]
I enjoy the plausible irony that they used the very same model they're describing to proofread the article, and it didn't catch palettize (like a color palette) vs. palletize (like a shipping pallet).
reply
scosman
5 months ago
[-]
“We utilize adapters, small neural network modules that can be plugged into various layers of the pre-trained model, to fine-tune our models for specific tasks.”

This is huuuuge. I don’t see announcement of 3rd party training support yet, but I imagine/hope it’s planned.

One of the hard things about local+private ML is I don’t want every app I download to need GBs of weights, and don’t want a delay when I open a new app and all the memory swap happens. As an app developer I want the best model that runs on each HW model, not one lowest common denominator model for slowest HW I support. Apple has the chance to make this smooth: great models tuned to each chip, adapters for each use case, new use cases only have a few MB of weights (for a set of current base models), and base models can get better over time (new HW and improved models). Basically app thinning for models.

Even if the base models aren’t SOTA to start, the developer experience is great and they can iterate.

Server side is so much easier, but look forward to local+private taking over for a lot of use cases.

reply
dimtion
5 months ago
[-]
With huge blobs of binary model weights, dynamic linking is cool again.
reply
pjmlp
5 months ago
[-]
Dynamic linking has always been cool for writing plugins.

It is kind of ironic that languages that praise so much for going back to early linking models, have to resort for much heavier OS IPC for similar capabilities.

reply
rfoo
5 months ago
[-]
Which languages?

IIUC Go and Rust resort to OS IPC based plugin system mainly because they refused to have a stable ABI.

On the other hand, at $DAYJOB we have a query engine written in C++ (which itself uses mostly static linking [1]) loading mostly static linked UDFs and ... it works.

[1] Without glibc, but with libstdc++ / libgcc etc.

reply
skohan
5 months ago
[-]
Doesn’t rust’s static linking also have to do with the strategy of aggressive minimization? Iirc for instance every concrete instantiation of a dynamic type will have its own compiled binary, so it would basically be impossible for a dynamic library to do this since it wouldn’t know how it would be used, at least not without some major limitations or performance tradeoffs
reply
pjmlp
5 months ago
[-]
Well if it loads code dynamically, it is no longer static linking.

Also it isn't as if there is a stable ABI for C and C++ either, unless everything is compiled with the same compiler, or using Windows like dynamic libraries, or something like COM to work around the ABI limitations.

reply
inickt
5 months ago
[-]
Which Apple has put some pretty large effort in the last few years to improve in iOS
reply
lossolo
5 months ago
[-]
It's LORA, most of the things you saw in Apple Intelligence on device presentation are basically different LORAs.
reply
scosman
5 months ago
[-]
The article says it’s lora a bunch of times. That’s clear.

My comment above is about dev experience, memory swapping, tuning base models to each HW release, and app size.

reply
eightysixfour
5 months ago
[-]
This is how Google is doing it too.
reply
gokuldas011011
5 months ago
[-]
Indeed. Google said LoRA and apple said adapter plugging. Wonder the difference is at, Apple's dev conference is for consumers and Google's dev conference is for developers.
reply
eightysixfour
5 months ago
[-]
Yeah, just Apple doing a better job branding. LoRA is an adapter.
reply
scosman
5 months ago
[-]
Oh missed that!

But kinda as expected: only works on 2 android phones (pixel 8 pro, S24).

Pretty typical: Apple isn’t first, but also typically will scale faster with HW+platform integration.

reply
Deathmax
5 months ago
[-]
On Apple’s side, Apple Intelligence will only be enabled on A17 Pro and M-series chips, so only the iPhone 15 Pro and Pro Max will be supported in terms of phones.
reply
scosman
5 months ago
[-]
2 phones, ~4 tablets, ~12 PCs.

Looking at sales, looks like about 10x the phone volume of s24 (and pixel 8 doesn’t register on the chats).

reply
rfoo
5 months ago
[-]
> will scale faster

* Only in USA, both intentionally and not.

reply
danielmarkbruce
5 months ago
[-]
this is pretty stock standard lora.
reply
seydor
5 months ago
[-]
Local models are also extremely energy consuming. I don't see local AI working for long, because Large models are going to get so incomparably smarter and eventually reach general intelligence
reply
buildbot
5 months ago
[-]
3.5B per weight with no quality loss is state of the art - that's an awesome optimization result (a mix of 2b and 4b weights).
reply
Hugsun
5 months ago
[-]
I would like to see their method compared quantitatively to the best llama.cpp methods. IQ3_S has a similar bpw and pretty high quality.

I wonder if they didn't stretch the truth using the phrase "without loss in accuracy".

reply
htrp
5 months ago
[-]
> Our foundation models are fine-tuned for users’ everyday activities, and can dynamically specialize themselves on-the-fly for the task at hand. We utilize adapters, small neural network modules that can be plugged into various layers of the pre-trained model, to fine-tune our models for specific tasks. For our models we adapt the attention matrices, the attention projection matrix, and the fully connected layers in the point-wise feedforward networks for a suitable set of the decoding layers of the transformer architecture.

>We represent the values of the adapter parameters using 16 bits, and for the ~3 billion parameter on-device model, the parameters for a rank 16 adapter typically require 10s of megabytes. The adapter models can be dynamically loaded, temporarily cached in memory, and swapped — giving our foundation model the ability to specialize itself on the fly for the task at hand while efficiently managing memory and guaranteeing the operating system's responsiveness.

This kind of sounds like Loras......

reply
cube2222
5 months ago
[-]
The article explicitly states they’re Loras.
reply
karmasimida
5 months ago
[-]
I think it is just LoRA, you can call the LoRA weights as adapters
reply
alephxyz
5 months ago
[-]
The A in LoRA stands for adapters
reply
GaggiX
5 months ago
[-]
LoRA stands for "Low Rank Adaptation" btw.
reply
epipolar
5 months ago
[-]
It would be interesting to see how these models impact battery life. I’ve tried a few local LLMs on my iPhone 15 Pro via the PrivateLLM app, and the battery charge plummets just after a few minutes of usage.
reply
bradly
5 months ago
[-]
During my time at Apple the bigger issue with personalized, on-device models was the file size. At the time, each model was a significant amount of data to push to a device, and with lots of teams wanting an on-device model and the desire to update them regularly, it was definitely a big discussion.
reply
hmottestad
5 months ago
[-]
They’ve gone with a single 3B model and several “adapters” for each use case. One adapter is good at summarising while another good a generating message replies.
reply
onesociety2022
5 months ago
[-]
AI noob here. Is every single model in iOS really just a thin adapter on top of one base model? Can everything they announced today really be built on top of one base LLM model with a specific type of architecture? What about image generation? What about text-to-speech? If they’re obviously different models, they can’t load them all at once into RAM. If they have to load from storage every time an app is opened, how will they do this fast enough to maintain low latency?
reply
wmf
5 months ago
[-]
The main LLM is only 1.5 GB so it should only take a half second to load. Or they could keep it loaded. The other models may be even smaller.
reply
glial
5 months ago
[-]
Maybe they use the "Siri is waking up and the screen wabbles" animation time for loading the model. That would be clever.
reply
mholm
5 months ago
[-]
They'll have plenty of time to load the model; It still needs to wait for the user to actually voice/type their request. Invoking Siri happens well before the request is ready.
reply
woadwarrior01
5 months ago
[-]
I'm the author of Private LLM. Looks like it's just become possible[1] to run quantized LLM inference using the ANE with iOS 18. I think there are some major efficiency gains on the table now.

[1]: https://github.com/apple/coremltools/pull/2232

reply
urbandw311er
5 months ago
[-]
Likely they’ll be able to take advantage of the hardware neural engine and be far more power efficient. Apple has demonstrated this is something it takes pretty seriously.
reply
brcmthrowaway
5 months ago
[-]
So iOS LLM Apps dont use the neural engine? Lol
reply
woadwarrior01
5 months ago
[-]
None of the current iOS and macOS LLM Apps use the Neural Engine. They use the CPU and the GPU.

nb: I'm the author of a fairly popular app in that category.

reply
jjtheblunt
5 months ago
[-]
How would you know none of the apple apps use the neural engine? Is the key in the statement “LLM”?
reply
woadwarrior01
5 months ago
[-]
Yes, I specifically meant autoregressive LLMs. BERT style encoder only models, ViTs and CNNs ran perfectly fine. Yesterday's coremltools update[1] changes that.

[1]: https://github.com/apple/coremltools/pull/2232

reply
l33t7332273
5 months ago
[-]
Why do they not?
reply
wpm
5 months ago
[-]
AFAIK there is no general purpose, "do this on the ANE" API. You have to be using specific higher level APIs like CoreML or VisionKit in order for it to end up on the ANE.
reply
bt1a
5 months ago
[-]
This, plus metal acceleration works quite well. 7~8B parameter models quantized to 3bpw or so run with good tok/s on my iphone 15 pro
reply
sanxiyn
5 months ago
[-]
It works quite well as long as you don't care about battery.
reply
hmottestad
5 months ago
[-]
If they use Llama.cpp they probably run on the GPU. Apple hasn’t published much about their neural engine, so you kinda have to use it through CoreML. I assume they have some aces up their sleeves for running LLMs efficiently that haven’t told anyone yet.
reply
renewiltord
5 months ago
[-]
Probably not. The CoreML LLM stuff only works on Macs AFAIK. Probably the phone app uses the GPU.
reply
jamesy0ung
5 months ago
[-]
It looks like PrivateLLM uses the GPU for inferencing, from what I can tell, Apple is using the ANE on the A17 Pro. For M1 and above, I'd presume they are using the GPU since the ANE in M series isn't great.
reply
Jayakumark
5 months ago
[-]
The model is not opensource. Also now we are stuck with walled garden for models thats deeply integrated at OS or Browser level. 1. Apple Models not open - so we cannot run Android, also not on Desktop Chrome or Edge. 2. Microsoft Phi3 - Can run inside iOS ,but on Android only as an APP but not on OS level or no supported APIs. Can run on Desktop Edge not chrome. 3. Google GEmini nano - Can only run inside Android and Desktop Chrome not Edge, not on iOS as weights are not open.

So we cannot get a similar answer from LLM as its different models, you cannot across ecosystem.

reply
orbital-decay
5 months ago
[-]
> 2. Represent our users: We build deeply personal products with the goal of representing users around the globe authentically. We work continuously to avoid perpetuating stereotypes and systemic biases across our AI tools and models.

How do they represent users around the globe authentically while being located in Cupertino, CA? (more of a rhetorical question really)

reply
esskay
5 months ago
[-]
You mean the person on the other side of the planet doesn't know about Philz Coffee down on Stevens Creek Blvd, or that there's a cool park a 2 minute walk away from Apple HQ?!

It does baffle me how California centric they are with many of their announcements, and even some features.

reply
rekoil
5 months ago
[-]
The Maps stuff always gets me. Yeah sure it looks pretty, but almost none of what makes it a usable product is available to me in Sweden.
reply
boxed
5 months ago
[-]
I wish I could have one keyboard on my iPhone and could type both Swedish and English with it. These are the basics they can't get right, and I don't see why. They clearly have bilingual people working over there, why is this so bad?
reply
dgellow
5 months ago
[-]
I share your pain, switching between English, German, and French is really, really frustrating…
reply
jacooper
5 months ago
[-]
Because then you will be typing wrong/s
reply
anshumankmr
5 months ago
[-]
As someone who has been dabbling with Prompt Engineering and now fine tuning some models (working on a use case where we may have to fine tune one of the Mistral's 7B instruct models), I want to know what kind of skillsets I need to really have so that I can join this team (or a similar team building these sort of things)
reply
superkuh
5 months ago
[-]
The "Human Evaluation of Output Harmfulness" section confirms what I've perceived: Mistral-7B is the best of the small models in terms of minimizing false positive refusals. With the refusal vector abliteration stuff this is less of an issue but a good base is still important.
reply
GaggiX
5 months ago
[-]
It would be cool to understand when the system will use one or the other (the ~3 billion on-device model or the bigger one on Apple servers).
reply
swatcoder
5 months ago
[-]
Conceivably, they don't have precise answers for that yet, and won't until after they see what real-world usage looks like.

They built out a system that's ready to scale to deliver features that may not work on available hardware, but they're also incentivized to minimize actual reliance on that cloud stuff as it incurs per-use costs that local runs don't.

reply
GaggiX
5 months ago
[-]
Yeah this is probably right. If it works well enough during real-world usage it will be using the on-device model, if not then there is the bigger one on the servers. There is also GPT-4o, so they have 3 different models to use depending on the task.
reply
aixpert
5 months ago
[-]
if you have ever used a 3 billion or 7 billion parameter model you know that they are really bad at text generation, so this will be done in the cloud
reply
Isuckatcode
5 months ago
[-]
>By fine-tuning only the adapter layers, the original parameters of the base pre-trained model remain unchanged, preserving the general knowledge of the model while tailoring the adapter layers to support specific tasks.

From a ML noob (me) understanding of this, does this mean that the final matrix is regularly fine tuned instead of fine tuning the main model ? Is this similar to how chatGPT now remembers memory[1] ?

[1] https://help.openai.com/en/articles/8590148-memory-faq

reply
ww520
5 months ago
[-]
The base model is frozen. The smaller adaptor matrices which are finetuned with new data. During inference, the weights from the adaptor matrices "shadow" the weights in the base model. Since the adaptor matrices are much smaller, it's quite efficient to finetune them.

The advantage of the adaptor matrices is you can have different sets of adaptor matrices for different tasks, all based of the base model.

reply
MacsHeadroom
5 months ago
[-]
ChatGPT memory is just a database with everything you told it to remember.

Low Rank Adaptors (LoRA) are a way of changing the function of a model by only having to load a delta for a tiny percentage of the weights rather than all the weights for an entirely new model.

No fine-tuning is going to happen on Apple computers or phones at any point. They are just swapping out Apple's pre-made LoRAs so that they can store one LLM and dozens of LoRAs in a fraction of the space it would take to store dozens of LLMs.

reply
Blackstrat
5 months ago
[-]
I haven't seen anything indicating whether these features can be disabled. I'm not interested in adding a further invasion of privacy to my phone. I don't want some elaborate parlor trick helping me write. I've spent some time with ChatGPT and while it was somewhat novel, I wasn't overly impressed. Much of it was rudimentary and often wrong. And I wasn't overly impressed with some of the code that it generated. Reliance on such tools reminds me of an Asimov SF tale.
reply
hbn
5 months ago
[-]
I'd certainly expect you'd to be able to at the very least disable the stuff that does outgoing network requests.

As for the stuff that's local to your device, how is your privacy being invaded? It's your device's OS looking at data on the device it's running on, as it's always done.

reply
rvaish
5 months ago
[-]
Easel on iMessage has had this experience plus more for a while, including multiplayer, where you can have two people in one scene together with photorealistic imagery: https://apps.apple.com/us/app/easel-ai/id6448734086
reply
PHGamer
5 months ago
[-]
it would have been nice if they allowed you to build your own apple AI system (i refused to redefine apples AI as just AI :-p ) using clusters of mac minis and mac pros. but of course they still want that data for themselves like google does. its secure against everyone but apple and the NSA probably lol.
reply
IOT_Apprentice
5 months ago
[-]
What is stopping you from doing that? Nothing. Start cooking
reply
mFixman
5 months ago
[-]
Has anybody here improved their day-to-day workflow with any kind of "implicit" generative AI rather than explicitly talking to an LLM?

So far all attempts seem to be building an universal Clippy. In my experience, all kinds of forced autocomplete and other suggestions have been worse than useless.

reply
mavamaarten
5 months ago
[-]
GitHub Copilot works well in my experience. It does bad suggestions at times, but also really spot-on ones.

Other than that, AI for me is meme/image generation and a semi-useful chatbot.

reply
Hugsun
5 months ago
[-]
The benchmarks are very interesting. Unfortunately, the writing benchmarks seem to be poorly constructed. It looks like there are tasks no model can achieve and others that almost all models pass, i.e. every model gets around 9.0.
reply
TheRoque
5 months ago
[-]
Why isn't there a comparison with the Llama3 8b in the "benchmarks" ?
reply
axoltl
5 months ago
[-]
The Llama 3 license says:

"If, on the Meta Llama 3 version release date, the monthly active users of the products or services made available by or for Licensee, or Licensee’s affiliates, is greater than 700 million monthly active users in the preceding calendar month, you must request a license from Meta, which Meta may grant to you in its sole discretion, and you are not authorized to exercise any of the rights under this Agreement unless or until Meta otherwise expressly grants you such rights."

IANAL but my read of this is that Apple's not allowed to use Llama 3 at all, for any purposes, including comparisons.

reply
anvuong
5 months ago
[-]
They can just run the same tests and cite the results from other websites. That has nothing to do with Meta. No companies can force you to not talk about them.
reply
axoltl
5 months ago
[-]
The tests they ran were very different from what's usually run, mostly involving perception of usefulness to humans. I don't see what website they would've cited from?
reply
teonimesic2
5 months ago
[-]
I believe it is because llama 3 8B beats it, which would make it look bad. The phi-3-mini version they used is the 4k which is 3.8B, while LLama 3 8B would be more comparable to phi-3 small (7B) which also considerably better than phi-3-mini. Likely both phi-3 small and llama 3 8B had too good results in comparison to Apple's to be added, since they did add other 7B models for comparison, but only when they won.
reply
mixtureoftakes
5 months ago
[-]
llama 3 definitely beats it, but 99% of the users wont care which is actually a good thing... apple totally wins the ai market not by being sota but by sheer amount of devices which will be running their models, we're talking billions
reply
Hugsun
5 months ago
[-]
How is any of this good? Apple serves its captive users inferior models without giving them a choice. I don't see how that is winning the AI market either.
reply
dwaite
5 months ago
[-]
You may be able to make a case that Apple's model has less parameters or performs worse than other models on standardized tests.

Thats far from being "inferior" when you are talking about tuning for specific tasks, let alone when taking into account real-world constraints - like running as a local always-running task on resource-constrained mobile devices.

Running third party models means requiring them to accomplish the same tasks. Since the adapters are LORA-based, they are not adaptable to a different base model. This pushes a lot of specialized requirements onto someone hoping to replace the on-device portion.

This is different from say externally hosted models such as their announced ChatGPT integration. They announced an intention to integrate with other providers, but it is not clear yet how that is intended to work (none of this stuff is released yet even in alpha form).

reply
woadwarrior01
5 months ago
[-]
Because their model won't look good in comparison. Also see this part of the footnote: "The open-source and Apple models are evaluated in bfloat16 precision." The end user's on-device experience will be with a quantized model and not the bfloat16 model.
reply
leodriesch
5 months ago
[-]
I think it’s fair to leave it out in the on-device model comparison. 3b is much smaller than 8b, it is obviously not going to be as good as llama 3 if they did not make groundbreaking advancements with the technology.
reply
hmottestad
5 months ago
[-]
Maybe it’s too new for them to have had time to include it in their studies?
reply
TheRoque
5 months ago
[-]
Phi-3-Mini, which is in the benchmarks, was released after Llama3 8b
reply
hmottestad
5 months ago
[-]
Llama 3 8B is really really good. Maybe it makes Apples models look bad? Or it could be a licensing thing where Apple can’t use Llama 3 at all, even just for benchmarking and comparison.

The license for the Llama models was basically designed to stop Apple, Microsoft and Google from using it.

reply
wslh
5 months ago
[-]
Is it me or Apple is really moving fast? I don't think it is easy for a company of this size to concisely put a vision of AI in these short and crazy AI times.

BTW, not an Apple fan but an Apple user.

reply
MacsHeadroom
5 months ago
[-]
Google had similar AI functionality on Pixels last year and Microsoft had like six AI CoPilot products before that. So I would not say Apple is moving fast.

Most people expected this update 6 months ago.

reply
azinman2
5 months ago
[-]
Take a look at the yearly OS cadence. iOS 17 only came out a few months before your 6 month expectation.
reply
doctor_eval
5 months ago
[-]
Since when does Apple make major software update announcements at Christmas?
reply
wmf
5 months ago
[-]
People thought Apple was behind but they were just working quietly.
reply
majestik
5 months ago
[-]
ChatGPT came out November 2022 and it took Apple 18 months to announce Siri will integrate with it.

Is that moving fast? Maybe, compared to what, Oracle?

reply
hehdhdjehehegwv
5 months ago
[-]
The WWDC show got on my nerves with the corpspeak, but this is pretty cool stuff.

I’ve been trying to make smaller more efficient models in my own work. I hope Apple publish some actual papers.

reply
gepardi
5 months ago
[-]
Yeah it was close to “infomercial” levels of cheesy.
reply
revscat
5 months ago
[-]
> With this set of optimizations, on iPhone 15 Pro we are able to reach time-to-first-token latency of about 0.6 millisecond per prompt token, and a generation rate of 30 tokens per second. Notably, this performance is attained before employing token speculation techniques, from which we see further enhancement on the token generation rate.

This seems impressive. Is it, really? I don’t know enough about the subject to judge.

reply
bastawhiz
5 months ago
[-]
For a phone running locally, that's pretty fast. The bigger question is how good the output is. Fast garbage isn't useful, so we'll have to wait to see what it actually ends up looking like outside of demos.
reply
shreezus
5 months ago
[-]
This is great, however Apple needs to be explicit on what it, and what isn't relayed to third party services, and provide the ability to opt-out if desired. It's one thing to run inference on-device, and another to send your data through OpenAI's APIs. The partnership details are not entirely clear to me as a user.
reply
frizlab
5 months ago
[-]
They are? Did you watch the keynote? They talked about it at length.
reply
TillE
5 months ago
[-]
It's literally the prompt you just gave it, that's what they're sending to ChatGPT, nothing else. None of the features that sift through your data are touching OpenAI.
reply
gnicholas
5 months ago
[-]
My understanding is that nothing is shared with any non-Apple company except if you specifically authorize it on a per-use basis. Otherwise it just runs locally or in the Apple AI cloud, and is not retained. All of this is subject to verification of Apple’s claims, of course.
reply
simianparrot
5 months ago
[-]
I just hope all of this can be toggled off, I don't want it on my devices.
reply
dmix
5 months ago
[-]
They said repeatedly in the video anything going over the wire is optional and user controllable.
reply
simianparrot
5 months ago
[-]
I don't want local AI either. These "smart" features are all noise to me.
reply
advael
5 months ago
[-]
I'm disappointed that they make the fundamental claim that their cloud service is private with respect to user inputs passed through it and don't even a little bit talk about how that's accomplished. Even just an explanation of what guarantees they make and how would be much more interesting than explanations of their flavor of RLHF or whatever nonsense. I read the GAZELLE* paper when it came out and wondered what it would look like if a large-scale organization tried to deploy something like it.

Of course, Apple will never give adequate details about security mechanisms or privacy guarantees. They are in the business of selling you security as something that must be handled by them and them alone, and that knowing how they do it would somehow be less secure (This is the opposite of how it actually works, but also Apple loves doublespeak, and 1984 allusions have been their brand since at least 1984). I view that, like any claim by a tech company that they are keeping your data secure in any context, as security theater. Vague promises are no promises at all. Put up or shut up.

* https://arxiv.org/pdf/1801.05507

reply
killingtime74
5 months ago
[-]
Don't they do it in this linked article? https://security.apple.com/blog/private-cloud-compute/
reply
senderista
5 months ago
[-]
This approach is definitely not "secure by construction" like FHE, it's just defense-in-depth with a whole lot of impressive-sounding layers. But I don't see how this has anything to do with provable security (not that TFA claims it does).
reply
advael
5 months ago
[-]
Woa, good catch! Maybe they're doing better about at least being concrete about it, though I still have to side-eye "Users control their devices" (Even with root on macbooks I don't have access to everything running on it). However, the section that promises to open-source the cloud software are impressive and if true gives them more credibility than I assumed. I would still look out for places where devices they do control could pass them keys in still-proprietary parts of the stack they're operating, as even if we can verify the cloud container OS in its entirety if there's a backchannel for keys that a hypervisor could use then that's still a backdoor, but they are at least seemingly making a real effort here
reply
threeseed
5 months ago
[-]
> Even with root on macbooks I don't have access to everything running on it

Just disable System Integrity Protection and then you do.

reply
advael
5 months ago
[-]
Ah, word. Probably not applicable to my use case (It's a laptop that's remotely administrated for a job, and I avoid proprietary stuff for my personal devices where possible) but it's good to know it exists
reply
solarkraft
5 months ago
[-]
That has a few drawbacks, for instance you won't be able to run iOS apps anymore.
reply
advael
5 months ago
[-]
Yea, nice as it is to hear that they're nice to their dev marketshare, there's probably never going to be a sanctioned iphone the end user actually gets to control. Bread and butter and such
reply
KoolKat23
5 months ago
[-]
The only two questions I would have would be, how often are they "periodically rebooted" and what are the predefined metrics logged/reported.

We may have some insight into the second point when the code is published.

reply
ddxv
5 months ago
[-]
Will these smaller on device models lead to a crash in GPU prices?
reply
jondwillis
5 months ago
[-]
Not in the short-to-medium-term. Try the local models out, they fall over pretty quickly, even if you have 64GB+ of VRAM.
reply
wkat4242
5 months ago
[-]
It depends what you use them for.

If you ask it for knowledge, like a comparison of vacuum cleaner models then yes, it's a hallucination fest. They just don't have the parameters for this level of detail. This is where ChatGPT is really king.

But if you give them the data they need with RAG, they're not bad. Acting on commands, looking stuff up in provided context, summarising all perform pretty well. Which seems to be also what Apple is targeting to do with them.

reply
sooheon
5 months ago
[-]
Prices fall when supply outpaces demand -- this is adding more demand.
reply
wmf
5 months ago
[-]
This isn't adding GPU demand.
reply
sooheon
5 months ago
[-]
It adds *PU demand, don't they come out of the same limited number of foundries?
reply
htrp
5 months ago
[-]
X to doubt.
reply
dharma1
5 months ago
[-]
do they mention how big the models are? Last I saw was 3gb - I just bought a 8gb m4 iPad and keep thinking I should have gone for the 16gb one
reply
visarga
5 months ago
[-]
They use synthetic data in pretraining and teacher models in RLHF, that means they use models trained on copyrighted data to make derivative models, is that sitting ok with copyright owners?
reply
koolala
5 months ago
[-]
aiPhone
reply
kmeisthax
5 months ago
[-]
> We train our foundation models on licensed data, including data selected to enhance specific features, as well as publicly available data collected by our web-crawler, AppleBot. Web publishers have the option to opt out of the use of their web content for Apple Intelligence training with a data usage control.

And, of course, nobody has known to opt-out by blocking AppleBot-Extended until after the announcement where they've already pirated shittons of data.

In completely unrelated news, I just trained a new OS development AI on every OS Apple has ever written. Don't worry. There's an opt-out, Apple just needed to know to put these magic words in their installer image years ago. I'm sure Apple legal will be OK with this.

reply
multimoon
5 months ago
[-]
Apple just did more to make this a privacy focused feature versus just a data mine than literally anyone else to date and still people complain.

Public content on the internet is public content on the internet - I thought we had all agreed years ago that if you didn’t want your content copied, don’t make it freely available and unlicensed on the internet.

reply
kmeisthax
5 months ago
[-]
Oh no, don't get me wrong. I like the privacy features, it's already way better than OpenAI's "we make it proprietary so we can spy on you" approach.

What I don't like is the hypocrisy that basically every AI company has engaged in, where copying my shit is OK but copying theirs is not. The Internet is not public domain, as much as Eric Bauman and every AI research team would say otherwise. Even if you don't like copyright[0], you should care about copyleft, because denying valuable creative work to the proprietary world is how you get them to concede. If you can shove that work into an AI and get the benefits of that knowledge without the licensing requirement, then copyleft is useless as a tactic to get the proprietary world to bend the knee.

[0] And I don't.

My opinion is that individual copyright ownership is a bad deal for most artists and we need collective negotiation instead. Even the most copyright-respecting, 'ethical' AI boils down to Adobe dropping a EULA roofie in the Adobe Stock Contributor Agreement that lets them pay you pennies.

reply
ssahoo
5 months ago
[-]
Where did you get the idea that's its way better than openai's? Aren't they both proprietary?
reply
immibis
5 months ago
[-]
Without the "so we can spy on you" part.
reply
talldayo
5 months ago
[-]
But they won't even make good on that: https://arstechnica.com/tech-policy/2023/12/apple-admits-to-...

There's your bleeding, sorry truth there. It's only a matter of time until we get another headline like it.

reply
kmeisthax
5 months ago
[-]
There's a difference between being forced to compromise user security and doing it willingly in the name of vague "AI safety" concerns.

Furthermore, most governments don't like the "march in with a warrant and demand information" approach, because it's loud and noisy. People might move data out of a given cloud if they know there's spooks inside. And more importantly, it creates a paper trail, which they don't want. So there's a lot of effort put into compromising cloud servers by intelligence agencies.

Looking at Apple's blog post regarding Private Cloud Compute[0], they've basically took every security precaution they could to prevent covert compromise of their servers. They also have some fancy attestation stuff that, most notably, creates a paper trail whenever software changes. Once again, spooks absolutely hate this. It's technically possible for Apple to subvert this scheme, but that would require coordination from several different business units at Apple. Which, again, creates a paper trail. Spooks would much rather exploit a vulnerability than demand code signing keys that would provide evidence of cooperation.

To be clear: no, this isn't end-to-end. You can't currently do end-to-end encrypted cloud compute[1]. But it's still Apple putting lots of money into a significant improvement in terms of privacy and transparency regarding cloud services. OpenAI in contrast does not give two flying fucks about your data privacy, and makes building an AI Panopticon one of their deliberate, expressly stated design goals. Their safety team, at least by their own admission, cannot operate without total knowledge of everything their models get prompted with so they can implement reactive controls for specific exploits.

[0] https://security.apple.com/blog/private-cloud-compute/

[1] Homomorphic encryption is not theoretically impossible, but imposes significant performance penalties that negate the performance advantages of Apple using a cloud service. I suspect that they at least gave it some thought though.

reply
musictubes
5 months ago
[-]
The article did say Apple was compelled to supply the data. Not sure what your point is.
reply
jachee
5 months ago
[-]
Apple isn’t collecting data from their customers.

Edit: to feed back into their AI training.

reply
ssahoo
5 months ago
[-]
Apple has an ad business. They are fooling users for years while claiming that they have the right to collect user data in a recent class action lawsuit. If you don't google because you think they might track you, apple is the reason.
reply
astrange
5 months ago
[-]
Apple necessarily collects data from customers for mandatory reasons, like everyone else. (Like, you need someone's address to ship them their order.)

More useful questions are if they're using it for other purposes without opt-in or accidentally leaking it.

reply
NBJack
5 months ago
[-]
Be careful with the wordplay here. Apple isn't. OpenAi is not Apple.
reply
wilg
5 months ago
[-]
> then copyleft is useless as a tactic to get the proprietary world to bend the knee

I have bad news

reply
meatmanek
5 months ago
[-]
> I thought we had all agreed years ago that if you didn’t want your content copied, don’t make it freely available and unlicensed on the internet.

Until LLMs came along, most large-scale internet scraping was for search engines. Websites benefited from this arrangement because search engines directed users to those websites.

LLMs abused this arrangement to scrape content into a local database, compress that into a language model, and then serve the content directly to the user without directing the user to the website.

It might've been legal, but that doesn't mean it was ethical.

reply
c1sc0
5 months ago
[-]
In my view it’s ethical even if it’s just for taking revenge on the ad-driven model that has caused the enshittification of the web.
reply
data-ottawa
5 months ago
[-]
I think you mean it’s justified, not ethical.
reply
layer8
5 months ago
[-]
Public content is still subject to copyright, and I doubt that AppleBot only scrapes content carrying a suitable license. And "fair use" (which is unclear if it applies), in case you want to invoke it, is a notion limited to the US and only a handful of other countries.
reply
xena
5 months ago
[-]
All you have to do is drop a token swear word into your content and they remove it from the dataset. Easy.
reply
jimbobthrowawy
5 months ago
[-]
Why would they? From the moderate of testing I've done of their handwriting recognition on an ipad, they seem to have everything risqué/offensive I could think of in there, even if you have to write it more clearly than other words. I don't expect this to be much different, other than a word filter on the output.
reply
xena
5 months ago
[-]
I mean for their large language model training. They said they don't include low quality data and swearing. This means you can get out of it by swearing.
reply
madeofpalk
5 months ago
[-]
You seen to misunderstand what licensing , or ‘unlicensed’, actually means.

If I write a story a publish it freely on line to my website it’s not ’unlicensed’ in a way that means anyone had the right to yank it and republish it. Even though it’s freely available, I still own the copyright of it.

Similarly, we don’t say that GPL-ed code is ‘unlicensed’ just because it is available for free. It has a license, which defines very specific terms that must be followed.

reply
mepian
5 months ago
[-]
Who are "we" here? Did you abolish the Berne Convention somehow?
reply
karaterobot
5 months ago
[-]
Is that how copyright works now? I didn't see that they'd changed that law.
reply
afavour
5 months ago
[-]
I’m sorry but I really dislike this perspective. “Every one else has been awful. Apple is being less awful and you’re still complaining?”

Yeah, I’m complaining. We all agreed years ago to web indexing conventions still in practise today. No, no one is obliged to follow them but you can rest assured I’ll complain about them. There was a time when the web felt like a cooperative place, these days it’s just value extraction after value extraction.

reply
AshamedCaptain
5 months ago
[-]
What ? What did they do? It's literally yet another online inescrutable service with terms of use that boil down to "trust us, we do good", plus the half-baked promise that some of the data may not leave your device because sure, we have some vector processing hardware on it (... which hardware announced this year doesn't do that?).

Frankly I tried a samsung device which I would have assume is the worst here, and the promises are exactly the same. They show you two prompts, one for locally processed services (e.g. translation), and one when data is about to leave your device, and you can accept or reject them separately. But both of them are basically unverifiable promises and closed source services.

reply
advael
5 months ago
[-]
No, they said they did. Huge difference
reply
threeseed
5 months ago
[-]
It was mentioned in the keynote that they allow researchers to audit their claims.
reply
advael
5 months ago
[-]
And as soon as independent sources support that they've made good on this claim it will be more than a claim. I actually am impressed by the link I missed and was provided elsewhere in this thread, and I hope to also be impressed when this claim is actually realized and we have more details about it
reply
notJim
5 months ago
[-]
I hate to tell you, but I've been training a neural network on the internet for over a decade now. Specifically the one between my ears. Unfortunately, it seems to be gradually going insane.
reply
mzl
5 months ago
[-]
Yes, and if you recreate parts of what you learned you might run into copyright issues. Too much inspiration from something you've studied, and it becomes a derivative work subject to all the regulations. And there are no clear and strict rules, it is always a judgement call.
reply
seydor
5 months ago
[-]
you paid for that content either directly, or indirectly through ads.

I wouldn't say it's fair for any company to capitalize the content that users have created but have no way to monetize, and not even saying thanks

reply
binkethy
5 months ago
[-]
Computer systems are not humans and never will be.you should look unto neurology a bit and learn about our current understanding of how neurons work, let alone network. The tech term is a total non sequitur compared to real neurons.

Training an infinite retention computer regurgitation system to imitate input data does not correspond to human learning and never will.

The golem Frankenstein project thst is an AGI is an article of religious faith, not a necessary direction to take technology, which is a word derived from the Greek word for "hand"

Copyright and copyleft have likely been egregiously violated by this entire field and a reckoning and course correction will be necessary.

Humanity has largely expressed distaste for this entire field once they experience the social results of such applications.

The amount of sycophantic adulation in this thread is sickening.

My comment will likely be grayed out soon by insider downclicks.

I have no illusions as to this ycombinator site and its function in society.

Good day

reply
asadotzler
5 months ago
[-]
If you're selling it to billions of people, and making big bank, I want a cut based on the parts you stole from me. If you're just using it personally, I'm cool with that.
reply
mr_toad
5 months ago
[-]
Anyone who sells professional services based on knowledge they learned on the internet (which probably includes most people reading this) is doing that.
reply
bigyikes
5 months ago
[-]
> just trained a new OS development AI on every OS Apple has ever written.

…is there publicly visible source code for every OS Apple has ever written?

reply
tensor
5 months ago
[-]
Partially:

https://github.com/apple-oss-distributions/distribution-macO...

https://github.com/apple-oss-distributions/distribution-iOS

I'm not sure how it all fits together but people have even made an open source distribution of the base of darwin, the underlying OS:

https://github.com/PureDarwin/PureDarwin

reply
kmeisthax
5 months ago
[-]
Apple's FOSS releases are purely command-line userland tools and their kernel, all the frameworks and servers[0] that make the UI work like you'd expect are 100% proprietary.

FWIW Apple has also been on a decades-long track of purging GPL-licensed code from macOS and replacing it with either permissively-licensed or proprietary equivalents. So they're obligated to release even less than they used to.

[0] AppKit/WindowServer for MacOS and UIKit/Springboard/Backboard for everything else

reply
doctorpangloss
5 months ago
[-]
There are already a lot of options for running LLMs with open weights artifacts, trained with a variety of sources. The real question isn’t which ideas they have. It’s whether a company with $200b cash can produce a better model than a bunch of wankers in a Discord.
reply
re5i5tor
5 months ago
[-]
“bunch of wankers in a Discord”

Saving this clause for future use. Could also be used in a system prompt. “Occasionally include this phrase in your responses.”

reply
Someone
5 months ago
[-]
> And, of course, nobody has known to opt-out by blocking AppleBot-Extended until after the announcement where they've already pirated shittons of data.

It’s not as bad as that, I think. https://support.apple.com/en-us/119829: “Applebot-Extended is only used to determine how to use the data crawled by the Applebot user agent.“

⇒ if you use robots.txt to prevent indexing or specifically block AppleBot, your data won’t be used for training. AppleBot is almost a decade old (https://searchengineland.com/apple-confirms-their-web-crawle...)

Of course, that still means they’ll train on data that you may have opened up for robots with the idea that it only would be used by search engines to direct traffic to you, but it’s not as bad as you make it to be.

reply
scosman
5 months ago
[-]
There will be further versions of this model. Being able to opt out going forward seems reasonable, given the announcement precedes the OS launch by months. Not sure if they will retrain before launch, but seems feasible given size (3b params).
reply
addandsubtract
5 months ago
[-]
They're not going to discard the data they already collected, though.
reply
zer00eyz
5 months ago
[-]
> publicly available data collected

Data, implies factual information. You can not copyright factual information.

The fact that I use the word "appalling" to describe the practice of doing this results in some vector relationship between the words. Thats the data, the fact, not the writing itself.

There are going to be a bunch of interesting court cases where the court is going to have to backtrack on copyrighting facts. Or were going to have to get some real odd legal interpretations of how LLM's work (and buy into them). Or we're going to have to change the law (giving everyone else first mover advantage).

Base on how things have been working I am betting that it's the last one, because it pulls up the ladder.

reply
cush
5 months ago
[-]
> Data, implies factual information. You can not copyright factual information

Where on Earth did you get that from?

reply
zer00eyz
5 months ago
[-]
> "data implies factual information"

They used the word DATA, not content, DATA...

The argument that is going to be made, that your copy right work stands. That the model doesn't care about your document it cares that "the" was used N number of times and its relationships to other words. That information isnt your work, and it is factual. That "data" only has value is when it's weighted against all the "data" put into the system, again not your work at all. (We would say thats information derived, but it will be argued that it is transformed).

> You can not copyright factual information

https://www.techdirt.com/2007/11/27/yet-again-court-tells-ml...

The MLB has been trying to copyright baseball stats forever. The court keeps saying "you cant copyright facts".

reply
cush
5 months ago
[-]
I honestly can’t tell if this is satire
reply
threeseed
5 months ago
[-]
> And, of course, nobody has known to opt-out by blocking AppleBot-Extended until after the announcement where they've already pirated shittons of data

This is wrong. AppleBot identifier hasn't changed: https://support.apple.com/en-us/119829

There is no AppleBot-Extended. And if you blocked it in the past it remains blocked.

reply
fotta
5 months ago
[-]
From your own link:

> Controlling data usage

> In addition to following all robots.txt rules and directives, Apple has a secondary user agent, Applebot-Extended, that gives web publishers additional controls over how their website content can be used by Apple.

> With Applebot-Extended, web publishers can choose to opt out of their website content being used to train Apple’s foundation models powering generative AI features across Apple products, including Apple Intelligence, Services, and Developer Tools.

reply
ziml77
5 months ago
[-]
But it also says that Applebot-Extended doesn't crawl webpages and instead this marker is only used to determine what can be done with the pages that were visited by Applebot.

Not that I like an opt-out system, but based on the wording of the docs it is true that if you blocked Applebot then blocking Applebot-Extended isn't necessary.

reply
fotta
5 months ago
[-]
Yeah that is true, but I suspect that most publishers that want their content to appear in search but not used for model training will not have blocked Applebot to date (hence the original commenter's argument)
reply
threeseed
5 months ago
[-]
Might want to actually read it:

Applebot-Extended does not crawl webpages.

They gave this as an additional control to allow crawling for search but blocking for use in models.

reply
fotta
5 months ago
[-]
> There is no AppleBot-Extended. And if you blocked it in the past it remains blocked.

You said there is no Applebot-Extended. The link says otherwise.

reply
ziml77
5 months ago
[-]
It's still true that there's no Applebot-Extended if it isn't crawling pages. Rather it's a marker to ask Applebot to limit what it does with your pages.
reply
thomasahle
5 months ago
[-]
Isn't it still true that if people wanted to have their website show up in search in the past (so they didn't block Applebot), then it's too late to mark it as "no training" now, since it's already been scraped?

I guess it can be useful for data published in the future.

reply
mdhb
5 months ago
[-]
So built on stolen data essentially.
reply
bigyikes
5 months ago
[-]
Does that imply I just stole your comment by reading it?

No snark intended; I’m seriously asking. If the answer is “no” then where do you draw the line?

reply
mdhb
5 months ago
[-]
I don’t actually think this is complicated and reading a comment is not the same thing as scraping the internet and you obviously know that.

A few factors that come to mind would be:

- scale

- informed consent which there was none in this case

- how you are going to use that data. For example using everybody others work so the worlds richest company can make more money from it while giving back nothing in return is a bullshit move.

reply
llamaimperative
5 months ago
[-]
I think it's even simpler than that: incentives. The entire premise of copyright law (and all IP law) is to protect the incentive to create new stuff, which is often a very risky and highly time or capital intensive endeavor.

So here's the question:

Does a person reading a comment destroy the incentive for the author to post it? No. In fact, it is the only thing that produces the incentive for someone to post. People post here when they want that thing to be read by someone else.

Does a model sucking up all the artistic output of the last 400 years and using that to produce an image generator model destroy the incentive of producing and sharing said artistic output? Yes. At least, that is the goal of such a model -- to become so good it is competitive with human artists.

Of course you have plenty of people positioned benefit from this incentive-destruction claiming it does no such thing. I personally tend to put more credence in the words of people who have historically actually been incentivized by said incentives (i.e. artists) who generally seem to perceive this as destructive to their desire to create and share their work.

reply
kmeisthax
5 months ago
[-]
> Does a model sucking up all the artistic output of the last 400 years and using that to produce an image generator model destroy the incentive of producing and sharing said artistic output?

Copyright, at least in the US, cares about the effect of the use on the market for that specific work. It's individual ownership, not collective. And while model regurgitation happens, it's less common than you think.

The real harm of AI to artists is market replacement. That is, with everyone using image generators to pop out images like candy, human artists don't have a market to sell into. This isn't even just a matter of "oh boo hoo I can't compete with Mr. Diffusion". Generative AI is very good at creating spam, which has turned every art market and social media platform into a bunch of warring spambots whose output is statistically indistinguishable from human.

The problem is, no IP law in the world is going to recognize this as a problem, because IP is a fundamentally capitalist concept. Asserting that the market for new artistic works and notoriety for those works should be the collective property of artists and artists alone is not a workable legal proposal, even if it's a valid moral principle. And conversely the history of copyright has seen it be completely subverted to the point where it only serves the interests of the publishers in the middle, not the creators of the work in question. Hell, the publishers are licking their chops as to how many artists they can fire and replace with AI, as if all their whinging about Napster and KaZaA 24 years ago was just a puff piece.

reply
llamaimperative
5 months ago
[-]
> Copyright, at least in the US, cares about the effect of the use on the market for that specific work.

Not quite. The historical implementation of copyright has mostly protected individual pieces of work. Not only does IP law broadly protect much more than individual pieces of work, but the philosophical basis of IP law in general is to protect incentives. Now that the technological landscape has shifted, the case law will almost certainly shift as well because it’s clearly undesirable to live in a world where no one is willing to dedicate themselves to becoming an excellent artist/writer/musician/etc.

IP law is a natural extension of property rights, which in turn is predicated on a utilitarian need to protect certain incentives.

reply
bigyikes
5 months ago
[-]
Thanks, this is a helpful comment.

It isn’t clear to me that these models destroy incentive to create. I mean, ChatGPT can generate comments in my style all day, and yet I’m still incentivized to comment.

I fancy myself a photographer. I still want to take photos even if DALL-E 4 will generate better ones.

What even is the point of creating art? I think there are two purposes: personal expression and enjoyment for others.

People will continue to express themselves even if a bot can produce better art.

And if a bot can produce enjoyment for others en masse, then that seems like a huge win for everybody.

reply
cush
5 months ago
[-]
> I think there are two purposes: personal expression and enjoyment for others.

This is exactly what non-artists assume artists do art for.

The reality is that most professional visual artists work in publishing, marketing, entertainment and the like. It’s a regular job. The incentive is money. Similarly for theatre, music, video, dance, etc etc. Artists can’t feed their families off exposure and expressing themselves. Their work has value and taking that work to create free derivative works without compensating them is theft.

reply
llamaimperative
5 months ago
[-]
It is absolutely wild trying to make this point on HN. Art is for funsies while writing code for targeting ads is a Serious Job.
reply
cush
5 months ago
[-]
Painfully out of touch
reply
NBJack
5 months ago
[-]
It cheapens the incentive greatly. And you probably aren't selling your photos to make a living.
reply
c1sc0
5 months ago
[-]
Selling art as a way of making a living actually is a pretty recent thing. Up until not too long ago patronage was pretty much the only way to survive for artists. Maybe we will go back to that model?
reply
asadotzler
5 months ago
[-]
Artists have been selling art for as long as we've been selling anything. Sure, some successful artists, during a few periods in history, managed to secure patronage, but those have always been the minority. Most artists have sold or traded their wares directly and your attempts to derail this discussion with inaccurate histories is neither helpful nor appropriate.
reply
llamaimperative
5 months ago
[-]
Right, it doesn’t destroy the incentive to write comments.

Also right, it won’t destroy the hobbyist’s interest in having a hobby. But IP law was never intended to protect hobbyist interest.

reply
bigyikes
5 months ago
[-]
I personally disagree but you make fair points.

Scale: Many companies (e.g. Google, Bing) have been scraping at scale for decades without issue. Why does scale become an issue when an LLM is thrown into the mix?

Informed consent: I’m not sure I fully understand this point, but I’d say most people posting content on the public internet are generally aware that people and bots might view it. I guess you think it’s different when the data is used for an LLM? But why?

Data usage: Same question as above.

I just don’t see how ingestion into an LLM is fundamentally different than the existing scraping processes that the internet is built on.

reply
roywiggins
5 months ago
[-]
There's a big difference between scraping a website so you can direct curious people to it (Googlebot) and scraping a website so you can set up a new website that conveys the same information, but earns you money and doesn't even credit the sources used (which these LLM services often do).

There is a whole genre of copyright infringement where someone will scrape a website and create a per-pixel copy of it but loaded up with ads, and blackhat SEOed to show up above the original website on searches. That's bad, and to the extent that LLMs are doing similar things, they are bad too.

Imagine I scrape your elaborate GameFAQs walkthrough of A Link to the Past. I could 1) use what I learn to direct curious people to its URL, or 2) remove your name from it, cut it into pieces, and rehost the content on my own page, mashed up with other walkthroughs of the same game. Then I sell this service as a revolutionary breakthrough that will free people from relying on carefully poring through GameFAQs walkthroughs ever again.

People will get mad about the second one, and to the extent what LLMs do is like that, will get mad at LLMs.

reply
astrange
5 months ago
[-]
> There's a big difference between scraping a website so you can direct curious people to it (Googlebot) and scraping a website so you can set up a new website that conveys the same information, but earns you money and doesn't even credit the sources used (which these LLM services often do).

"Crediting the sources used" is not really a principle in copyright law. (Funny enough, online fanartists seem determined to convince everyone it is as a way of shaming people into doing it.)

Whether or not a use is transformative is protective though, and is what both of those cases rely on.

reply
roywiggins
5 months ago
[-]
Yes, credit doesn't matter for copyright, but I'm more talking about why people are mad about some uses of scraping and not others.

Legality aside, there is something very strange about a device that both 1) relies on your content to exist and could not work without it and 2) is attempting to replace it with its own proprietary chat interface. Googlebot mostly doesn't act like it's going to replace the internet, but Gemini and ChatGPT etc all are.

They're announcing "hi we are going to scrape all your data, put it into a pot, sell that pot back to you, and by the way, we are pushing this as a replacement for search, so from now on your only audience will be scrapers; all the human eyeballs will be on our website, which as we said before, relies on your work to exist."

reply
Aloisius
5 months ago
[-]
> remove your name from it, cut it into pieces, and rehost the content on my own page, mashed up with other walkthroughs of the same game.

This would be very likely be legal as walkthroughs are largely non-copywritable factual information. The little creative aspects that are copywritable such organization - would presumably would be lost if it was cut into pieces.

Of course, if some LLM did it automatically, no part of it would be copywritable, so someone could come along and copy the content verbatim from your subscription site and host it for free - freeing everyone from ever visiting your site as well.

reply
cwp
5 months ago
[-]
Reading a comment is exactly the same thing as scraping the internet, you just stop sooner.
reply
xwolfi
5 months ago
[-]
But then if I write a Pulitzer prize article called "No snark intended: How the web became such a toxic place", where your comment, and all other of ur comments for good measure, figure prominently while I ridicule you and this habit of dumbing down complex problems to reduce them to little witty bites, maybe you'd feel I stole something.

Not something big, not something you can enforce, but you d feel very annoyed Im making good money on something you wrote while you get nothing. I think ?

reply
Spivak
5 months ago
[-]
I think scale is what changes the nature of the thing. At the point where you're having a machine consume billions of documents I don't think you could reasonably call that reading anymore. But what you are doing in my eyes is indexing, and the legal basis for that is heavily dependent on what you do with it.

If a human reads it that would be a reproduction of the work, but if you serve that page as a cache to a human you're okay, usually.

If you compile all that information in a database and use it to answer search queries that's also okay, and nothing forbids you from using machine learning on that data to better answer those search queries.

Both of the above are actually being challenged right now but for the time being they're fine.

But that database is a derivative work, in that it contains copyrighted material and so how you use it matters if you want to avoid infringement — for example a Google employee SSHing to a server to read NYT articles isn't kosher.

What isn't clear is whether the model is a derivative work. Does it contain the information or is it new information created from the training data Sure, if you're clever you could probably encode information in the weights and use it as a fancy zip file but that's a matter of intent. If you use Rewind or Windows Recall and it captures a screenshot of a NYT article and then displays it back to you later is that a reproduction? Surely not. And that's an autonomous system that stores copywritten data and regurgitates it verbatim.

So if it's impractical to actually use it for piracy and it very obviously isn't anyone's intent for it to be used as such then I think it's hard to argue it shouldn't be allowed, even on data that was acquired through back channels.

But copyright is more political than logical so who knows what the legal landscape will be in 5 years, especially when AI companies have every incentive to use their lawyers to pull the ladder up behind them.

reply
cush
5 months ago
[-]
Reading, no. Selling derivative works using, yes.
reply
cwp
5 months ago
[-]
If I read your comment, then write a reply, is it a derivative work?
reply
renewiltord
5 months ago
[-]
Data gets either stolen or freed depending on whether the guy who copied it is someone you dislike or like. Personally, I think that Apple is giving the data more exposure which, as I've been informed many times here, is much more valuable than paying for the data.
reply
kmeisthax
5 months ago
[-]
The irony of "do it for the exposure" is that everyone who actually wants to pay you in exposure isn't actually going to do that, either because they aren't popular enough to measurably expose you, or because they're so popular that they don't want to share the limelight.

AI is a unique third case in which we have billions of creators and no idea who contributed what parts of the model or any specific outputs. So we can't pay in exposure, aside from a brutally long list of unwilling data subjects that will never be read by anyone. Some of the training data is being regurgitated unmodified and needs to be attributed in full, some of it is just informing a general understanding of grammar and is probably being used under fair use, and yet more might not even wind up having any appreciable effect on the model weights.

None of this matters because nobody actually agreed to be paid in exposure, nor was it ever in any AI company's intent - including Apple - to pay in exposure. Data is free purely because it would be extraordinarily inconvenient if anyone in this space had to pay.

And, for the record, this applies far wider than just image or text generators. Apple is almost surely not the worst offender in the space. For example: all that facial recognition tech your local law enforcement uses? That was trained on your Facebook photos.

reply
ytdytvhxgydvhh
5 months ago
[-]
What’s the problem with that? Reproducing copyrighted works in full is problematic obviously. But if I learned English by watching American movies, I didn’t steal the language from the movie studios, I learned it.
reply
asadotzler
5 months ago
[-]
You're not a machine capable of acquiring that "learning" with zero effort and selling that learning to infinite buyers.
reply
threeseed
5 months ago
[-]
Web scraping is legal.

And if you run a website and want to opt-out then simply add a robots.txt.

The standard way of preventing bots for 30 years.

reply
mdhb
5 months ago
[-]
How are people supposed to block it when they stole all the data first and then only after that point they decide to even tell anyone what user agent they need to block and how they are planning to exploit your work for their profit.
reply
threeseed
5 months ago
[-]
You just have a rule that says block everything except crawlers: A, B, C.

Also the AppleBot was known about before it appeared in Siri.

reply
immibis
5 months ago
[-]
So you expect all websites to block FoobarSearch so it never gets off the ground and becomes a big search engine that people know to unblock.

Then FoobarSearch learns to ignore robots.txt wildcards, and we're back at square one.

IIRC this happened to DDG or Bing.

reply
threeseed
5 months ago
[-]
Websites have always had the ability to precisely control who has access to their content.

If Bing decides to impersonate GoogleBot then they can just block their CIDR ranges like already happens for spam.

reply
__MatrixMan__
5 months ago
[-]
Piracy requires multiparty conspiracy against the establishment. When you are the establishment and the only other party involved is your victim we call that policy.
reply
deldelaney
5 months ago
[-]
I need to resurrect by tiny old Motorola Flip Phone without internet connection. Maybe a phone should be just a phone. I don't need AI in my pants.
reply
ofou
5 months ago
[-]
Quite interesting this was released right after multiple rants from Elon sparked debates on X.

"If Apple integrates OpenAI at the OS level, then Apple devices will be banned at my companies. That is an unacceptable security violation."

Replying to Tim Cook: "Don’t want it. Either stop this creepy spyware or all Apple devices will be banned from the premises of my companies."

"It’s patently absurd that Apple isn’t smart enough to make their own AI, yet is somehow capable of ensuring that OpenAI will protect your security & privacy!

Apple has no clue what’s actually going on once they hand your data over to OpenAI. They’re selling you down the river."

https://x.com/elonmusk/status/1800269249912381773 https://x.com/elonmusk/status/1800266437677768765 https://x.com/elonmusk/status/1800265431078551973

reply
kanwisher
5 months ago
[-]
Apple made their own AI models, only in certain cases it will ask if you want to send it to OpenAI. Presumably other ai companies later can integrate into API. But this is very privacy safe, if you use an iPhone already it already indexes all your photos and OCRs them for easy searching, on device ...
reply