Hacker News new | ask | show | jobs
by peter303 1032 days ago
The larger language models now employ a trillion parameters. This is faster when memory and computing is tighter, not distributed. Cerebus's million core super-wafer addresses this.