Hacker News new | ask | show | jobs
by Tenoke 2322 days ago
The code for the distributed training library, not the model - https://github.com/microsoft/DeepSpeed/