Hacker News new | ask | show | jobs
by talldayo 743 days ago
Almost makes you wonder if they're inferencing models on that hardware but not training it there. They worded that part of the presentation very carefully.

Which also begs the question; how much money would Apple save by using Nvidia for everything? Probably not much since they don't have to pay margins on Apple Silicon bought for themselves. But I suspect there is a literal monetary cost to bruteforcing an Nvidia-scale server network with weaker hardware.

1 comments

Yeah I’d guess inference, training relies much more on high speed interconnect between nodes, which I’m sure they could do but it’s certainly another step up in complexity.