Hacker News new | ask | show | jobs
by stigsb 2895 days ago
If training requires N operations, using more GPUs (that otherwise would idle) simply means you finish (and iterate) faster.