Hacker News new | ask | show | jobs
by Eridrus 2807 days ago
Optimization as a tool is important and widely used, but... almost everything grabbing headlines uses some form of SGD + Momentum. Very little of the actual progress comes from better optimization.