Hacker News new | ask | show | jobs
by samarth0211 68 days ago
This is a fantastic step toward democratizing large model training. Making 100B+ parameter training accessible on a single GPU could open the door to a lot more independent research. Really impressive work!