|
|
|
|
|
by WithinReason
618 days ago
|
|
If you spent some time actually training networks you know that's not true, that's why batch norm, dropout, regularization is so successful. They don't increase the network's capacity (parameter count) but they increase its ability to learn. |
|