Hacker News new | ask | show | jobs
by zargon 373 days ago
> Don't the parallelizing techniques of a 4x build make using them more difficult than a 1x build with no extra parallelism?

For inference, no. For training, only slightly.