Y
Hacker News
new
|
ask
|
show
|
jobs
by
Dylan16807
811 days ago
As long as the GPU-local memory can hold a couple layers at a time, I don't think the latency to the currently-inactive layers matters very much, only the bandwidth.