Hacker News new | ask | show | jobs
by mordae 1203 days ago
Isn't gradient descent basically PID over parameters? And tricks like momentum basically a low-pass filter integrated in the PID loop? It's quite weird how not that many concepts from analog electronics domain have gotten carried over to ML.