|
|
|
|
|
by bee_rider
106 days ago
|
|
> Directions we think are wide open > Second-order optimizers and natural gradient methods Do second order optimizers help improve data efficiency? I assumed they’d help you get to the same minimum faster (but this is way outside my wheelhouse). |
|