|
|
|
|
|
by mordae
1203 days ago
|
|
Isn't gradient descent basically PID over parameters?
And tricks like momentum basically a low-pass filter integrated in the PID loop? It's quite weird how not that many concepts from analog electronics domain have gotten carried over to ML. |
|