Y
Hacker News
new
|
ask
|
show
|
jobs
by
Alifatisk
162 days ago
> By keeping computation and memory on a single wafer-scale processor, we eliminate the data-movement penalties that dominate GPU systems. The result is up to 15× faster inference, without sacrificing model size or accuracy.
https://xcancel.com/andrewdfeldman/status/201154226777402186...