Hacker News new | ask | show | jobs
by aurareturn 501 days ago
You can load giant models onto normal RAM such as on an Epyc system but they're still mostly bottlenecked by low memory bandwidth.