Hacker News new | ask | show | jobs
by daviddumenil 2882 days ago
I think it was more a case that Google had a large existing investment in CPU-based compute.

A lot of Dean's initial work was getting models effectively training on that kind of hardware using distributed SGD.