|
|
|
|
|
by fragmede
317 days ago
|
|
The paper itself is fairly popular, with several thousand citations. Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, Jeff Dean https://arxiv.org/abs/1701.06538 |
|