|
|
|
|
|
by throwaway5486nv
1589 days ago
|
|
My understanding is more than the density configuration of the neurons matters most. The reason is in some cases neural network with drop out's perform better than fully connected neural network. This proves less dense networks can be more intelligent. |
|
Case in point: https://openai.com/blog/block-sparse-gpu-kernels/