Hacker News new | ask | show | jobs
by buildbot 744 days ago
I think the most important and somewhat massive news is that Apple built an entire LLM private cloud using Apple Silicon - that's a big deal/investment.
2 comments

Almost makes you wonder if they're inferencing models on that hardware but not training it there. They worded that part of the presentation very carefully.

Which also begs the question; how much money would Apple save by using Nvidia for everything? Probably not much since they don't have to pay margins on Apple Silicon bought for themselves. But I suspect there is a literal monetary cost to bruteforcing an Nvidia-scale server network with weaker hardware.

Yeah I’d guess inference, training relies much more on high speed interconnect between nodes, which I’m sure they could do but it’s certainly another step up in complexity.
I noticed Tim said "LLM and diffusion models", is diffusion models just the image generation stuff?
Yeah diffusion models are typically for image gen