Hacker News new | ask | show | jobs
by maxerize 3611 days ago
As the above poster said, if you can introduce people to a topic by explaining a simplified or 'naive' solution/algorithm, then that could be a good springboard to learn more about the topic. Which is why there should be more comments presenting improvements and alternatives, rather than criticising what is obviously meant as a primer on optimisation. From the intro:

Gradient descent is a standard tool for optimizing complex functions iteratively within a computer program. Its goal is: given some arbitrary function, find a minima.

For some small subset of functions - those that are convex - there's just a single minima which also happens to be global. For most realistic functions, there may be many minima, so most minima are local.

Making sure the optimization finds the "best" minima and doesn't get stuck in sub-optimial minima is out of the scope of this article.

Here we'll just be dealing with the core gradient descent algorithm for finding some minima from a given starting point.

1 comments

The way the author uses the word "minima" as if it was both singular and plural is a really discouraging sign.