Hacker News new | ask | show | jobs
by pk-protect-ai 475 days ago
But for the training it does. You need to communicate gradient changes between GPUs.