But I really do question how well Windows on Arm is really going to work out long term.
For Apple it worked because they were able to force the issue. If you wanted a new Mac it was going to be Arm and we all knew eventually (this year or is it next year?) Intel support would drop. Over time we have seen M series exclusive features.
Developers were forced to update or abandon Mac which gave users a great experience (with some early growing pains).
This is something that Windows will never be able too do. They will always be stuck maintaining an emulator and a likely large subset of apps only supporting one over the other. (also does this work the other way around with an Arm only app working on x86?)
This seems like a repeat of when it was not uncommon for games to only support Intel or AMD or NVIDIA or AMD. But worse since they are not both x86. Sure at least we have emulation but just like with Rosetta2 it shouldn't ever be the long term solution.
They only know Apple, Windows and Chromebooks.
I expect we'll get there in a few years, so perhaps this is Nvidia taking an early step in that direction.
In that case, this goes against Anthropic and OpenAI's business models. Which is a double whammy after Jensen Huang's recent comment about how agentic coding will only increase demand for software engineers, not reduced it.
So it also feels like a part of a budding shift in the competitive tension between the various parts of the AI supply chain.
One reason it works surprisingly well on modern systems is how much is offloaded to the GPU. You aren't going to get great power optimization or anything without it being truly native though.
There are games which are CPU limited though, and it will be interesting how those do. Curiously those also tend to be in engines with Arm support already.
What would be interesting to me would be how quickly developers start targeting ARM64 directly.
No one seriously cares about this running Windows. We want Steam and CUDA/Ollama, and Windows just gets in the way. nVidia are simply not that oblivious, but I have to admit in their position I'd have considered the Microsoft involvement more trouble than it's worth, which is among the many reasons I'm not a billionaire.
Maybe they think the RAM market is so terrible it will kill the whole initiative regardless.
Has Steam finally started to push for native Linux games instead of translating Windows ones?
So with proprietary blobs that give you more trouble that they're worth?
Looking at devices like the NVIDIA Shield gives me some hope that NVIDIA will be better than Qualcomm here. I just hope this is not a case where the OEM has to purchase X years of driver support from the chip vendor beforehand, and that NVIDIA will provide support directly itself.
Of course, DGX Spark is a miniPC, so laptops will likely be slower due to power limits/throttling.
Around 2-3K USD something with a good GPU + CPU + 128GB of integrated RAM is just going to be an awesome experience.
Considering Mac options are north of 5K+ even on a regular day.
"Introducing the NVIDIA RTX Spark™ Superchip. The fusion of NVIDIA AI and RTX graphics in a single chip redefines Windows PCs and delivers amazing creating, AI development, and gaming—on the slimmest, most beautiful RTX laptops ever and small, ultra-efficient desktops."
Nvidia is also very very rich and pushes the boundaries of stuff. They stoped waiting for industry standards. You can see this in there network stuff. All nvidia.
Next logical step (at least now, not something i thought about) was there CPU for their GPU racks/clusters/systems.
Now they have everything anyway, RTX Spark is just logical.
I don't think its specificly targeted at Apple at all.
Apple has like 10-15% market share and just because some IT nerds buy themselves a mac mini doesn't mean much.
Plenty of them actually just run openclaw without local models. Something which surprised me quite a lot.
But i have two 4090 at home. They consume a lot of power and i had to research the proper Mainboardmodel and had to mod one 4090 to use water cooling because they run too hot.
There Spark setup was at 3k, way to expensive for normal people. If they can get this down and sell more, great for their ecosystem (strengthening it) and getting more money from people.
It does surprise me though that they have enough capacity for this chip and not just putting everyting in Rubin but perhaps the build out has slowed down a little or they start to diverse already for economic savety
+ Windows
+ Screen
- ConnectX-7 Smart NIC
Can the link type be toggled between Ethernet and Infiniband? (Don't think I've ever heard of a laptop with IB.)
Physically, NVIDIA did the GPU chiplet and Mediatek did the other chiplet that has the CPU, DRAM controller, and IO.
https://www.bhphotovideo.com/c/product/1957120-REG/apple_mbp...
$3649 with 128GB of ram
Bosgame M5 AI Mini Desktop Ryzen AI Max+ 395 96GB variant €1.800,95 (sold out)
128GB+2TB variant €2.401,95 (in stock)
I have the latter, it's fantastic
- 5090/6000 Pro: 1792GB/s
- 5080:: 960GB/s
- 5070Ti: 892GB/s
- M3 Ultra: 819GB/s
- DGX Spark: 273GB/s (less than an M5 Pro at 307GB/s)
Memory bandwidth isn't everything but it will cap inference rate pretty heavily. Also, the M3 Ultra is for an almost 2 year old Mac Studio. It's widely expected that it'll be refreshed in Q3 with a likely M5 or M4 Ultra with >1000GB/s. I really hope Apple realizes what a market opportunity Apple has here.
The above shows just how good value the 5090 really is. It basically is a stripped down rTX 6000 Pro, which is a ~$10k card, for 20-30% of the price. This also demonstrates how NVidia uses VRAM for market segmentation. As an aside, the true data center cards (eg B100, H100) use HBM memory at ~3.2TB/s.
[1]: https://wccftech.com/nvidia-enters-pc-space-with-rtx-spark/
All I care about is if I can get one of these for significantly less than a dgx and get Linux on it for some cuda Blackwell kerneling.
What does AMD or Intel have here?
I'm not sure if I like this. Sure for a laptop this might be not a big problem but if this ARM ecosystem is a success it will spread to desktop computers and I fear we could lose the existing modularity.
But yes, it tends to be soldered on.
I don't think so.
This most likely be a winmodem situation, again
I think more announcements will follow soon from other companies.
Nvidia really threw stuff over the wall with the DGX Spark release. They don't seem to really care. I sort of think they'll spend a little more time on Windows, where there's no pesky upstreaming to do and they can just do whatever, but man, it's such typical hubris from Nvidia to build such an expensive box with good chips but make it basically unsupportable and roasty hot all the time.
You also generally have to run an ever more stale two year old Ubuntu derived DGX OS to get anywhere, with bespoke kernel and drivers all. None of it is well supported, none of it just works like a comparable PC or even well behaved arm system would.
As for other ARM, there were rumors AMD Sound Wave is/was going to be a ~10W arm APU, but there hasn't been much said about it lately. Honestly given the ram crunch, it's maybe just not worth trying to build a system with a cheap core, if the rest of your costs are going to stay so stratospheric. https://www.techpowerup.com/341848/amd-sound-wave-arm-powere...
A powerful new chapter for Windows PCs, accelerated by Nvidia RTX Spark
https://news.ycombinator.com/item?id=48352693
Surface Laptop Ultra: Made for World Makers
bechmarks with DGX arnt spectacular for NVIDIAs software and CUDA lead.
wouldnt count on this being a price/compute challenger. especially with overpriced VRAM.
All those CUDA cores in the sparks but they're starved for memory bandwidth.
I am still waiting for NVidia to release a system that legit beats 3090 maxxing for the home gamer...
Spark:
OS: Windows/Ubuntu
Mbw: 300GB/s
Cuda cores: 6000
GPU accelerated containers: yes
M5 max:
OS: macOS
Mbw: 600GB/s
Cuda cores: 0
GPU accelerated containers: noThe sparks are good if your ultimate plan is to spend even more on NVidia hardware in future to run your dev setups at usable speeds. Or, you're developing for a work cluster.
If you mainly want to run local models at acceptable speeds portably, buy a mac with lots of RAM. If you’re happy with non-portable / racked, buy 3090s (dense) or mac studios (MoEs). Buy newer cards if you are restricted on power or slots. If you are rich, buy a6000 blackwells.
Also I heard the tensor core instructions on the dgx are gimped and you’re better off with a rtx pro x000. Is that the same with these machines?
And is it really a way to lock in people? With AI coding tools, isn’t it trivial to write software on top of CUDA and rewrite it to target some other hardware?
no.
NVIDIA and Microsoft Reinvent Windows PCs for the Age of Personal AI
https://news.ycombinator.com/item?id=48352705
NVIDIA DGX Station for Windows Puts a Trillion-Parameter AI Supercomputer on Every Enterprise Desk
https://news.ycombinator.com/item?id=48352691
Introducing Surface Laptop Ultra: Made for world makers
https://news.ycombinator.com/item?id=48352627
Introducing a powerful new chapter for Windows PCs, accelerated by NVIDIA RTX Spark
Sure the graphics capabilities are probably very good. But if you’re a game developer who has traditionally built on Windows on x86 chips, would you want to invest in this new chip or invest in making games for the Apple ecosystem? Aren’t there more new customers to reach in the Apple world than this new Nvidia world?
Windows and the new chip. Higher developer productivity and higher chances of a substantial audience.
I think they make a great "second device" where you have something meatier to fall back to if something doesn't quite work right. I'm not sure if it's ready to take on the "main device" role just yet. But it's a far far better experience than the Surface RT days.
I was disappointed to see that the RTX Spark has the ARM cores from the DGX Spark. I was hoping it had their new in-house developed cores that Nvidia is starting to use on their latest gen server parts. They look really fast. That said, if RTX Spark has CPU performance like the DGX Spark, it will be almost as fast as the top AMD/Intel parts.