Hacker News new | ask | show | jobs
by MSFT_Edging 634 days ago
Datacenters aren't running M-series chips.
2 comments

Well not M-series chips specifically, but chips optimized for these kind of workloads (like the neural engine in M-series chips is).
IIRC The M series chip isn’t specifically optimized for ML workloads, the biggest gain it has is having unified video and cpu memory as transferring layers between the two is a big bottleneck on non Apple systems.

Real ML hardware (like the Nvidia H1000s) that can handle the kind of inference traffic you see in production get hot and use quite a bit of energy, especially when they run at full blast 24/7

Google’s TPU energy usage is a well-kept secret / competitive advantage. If energy efficiency isn’t a major concern for them, I bet it will be in a couple years.