Hacker News new | ask | show | jobs
by fragmede 317 days ago
The paper itself is fairly popular, with several thousand citations.

Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer

Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, Jeff Dean

https://arxiv.org/abs/1701.06538