Hacker News new | ask | show | jobs
by singhrac 1928 days ago
For those searching, DeepSpeed is implemented as a set of C++/CUDA extensions on top of PyTorch (compiled using their JIT).