Hacker News new | ask | show | jobs
by partykid92 3509 days ago
quasi newton methods are not square in the dimension of the problem (think limited memory L-BFGS), and can be run in linear time. In my experience, however, they're 2-3 times slower than regular methods like ADAM.