Hacker News new | ask | show | jobs
by sdenton4 59 days ago
The randomness (and exploration) encouraged by batch training also helps avoid 'real' minima, if they exist.