Y
Hacker News
new
|
ask
|
show
|
jobs
by
zargon
373 days ago
> Don't the parallelizing techniques of a 4x build make using them more difficult than a 1x build with no extra parallelism?
For inference, no. For training, only slightly.