Hacker News new | ask | show | jobs
by gugagore 240 days ago
This is the first time I see "SGD" to mean "standard gradient descent" and not "stochastic gradient descent".
2 comments

Presumably that's just a mistake. The author calls it "stochastic gradient descent" correctly elsewhere in the article
haha oops yeah the other comment is correct- that was just a mistake

I originally wrote "vanilla" there but didn't want to repeat that word twice in a row so swapped it for "standard" without realizing it now looked like the SGD acronym

just fixed that to avoid confusion- thanks for pointing it out!