A lot of Dean's initial work was getting models effectively training on that kind of hardware using distributed SGD.
A lot of Dean's initial work was getting models effectively training on that kind of hardware using distributed SGD.