Hacker News new | ask | show | jobs
by justinc-md 1444 days ago
How distributable is training a model like this? Do all the gpus need to be physically close together or well networked?