Hacker News new | ask | show | jobs
by fragmede 586 days ago
Training a trillion parameter model.