Hacker News new | ask | show | jobs
by int_19h 830 days ago
They do, but for inference at least, it's memory bandwidth that is the primary limiting factor for home LLMs right now, not raw compute.
1 comments

Wonder if the apple silicon ultra series will start using HBM3(e) on desktop in the future.