Hacker News new | ask | show | jobs
by stephenroller 1919 days ago
Support for this was also added to [Fairscale](https://fairscale.readthedocs.io/en/latest/) and [Fairseq](https://github.com/pytorch/fairseq) last week. In particular, the Fairscale implementation can be used in any pyotrch project without requiring the use of the Deepspeed trainer.
1 comments

What are the relevant commits in Fairseq for this? I couldn't figure out the changes by looking at the commits from last week.