Y
Hacker News
new
|
ask
|
show
|
jobs
by
cjbprime
64 days ago
Inference (not training) is bottlenecked by memory access speed, not compute. Having special hardware wouldn't make it faster unless you somehow found a faster memory controller than the GPU has.