Hacker News new | ask | show | jobs
by wnoise 2447 days ago
Yes, saddle points are far more common than local minima in high dimension. Unfortunately they're really good at slowing down naive gradient descent...