I think the most important and somewhat massive news is that Apple built an entire LLM private cloud using Apple Silicon - that's a big deal/investment.
Almost makes you wonder if they're inferencing models on that hardware but not training it there. They worded that part of the presentation very carefully.
Which also begs the question; how much money would Apple save by using Nvidia for everything? Probably not much since they don't have to pay margins on Apple Silicon bought for themselves. But I suspect there is a literal monetary cost to bruteforcing an Nvidia-scale server network with weaker hardware.
Yeah I’d guess inference, training relies much more on high speed interconnect between nodes, which I’m sure they could do but it’s certainly another step up in complexity.
Which also begs the question; how much money would Apple save by using Nvidia for everything? Probably not much since they don't have to pay margins on Apple Silicon bought for themselves. But I suspect there is a literal monetary cost to bruteforcing an Nvidia-scale server network with weaker hardware.