|
|
|
|
|
by Houshalter
3096 days ago
|
|
This thread is about reinforcement learning which definitely suffers from local minimas. But even vanilla supervised nets suffer from local minima. Anyone who's played with them has encountered it. Here you can mess around with a neural net live in the browser and it very easily gets stuck if you try more than 3 layers (especially try the spiral dataset): http://playground.tensorflow.org/ |
|
Check any of the literature on this subject: https://arxiv.org/abs/1611.06310v2
https://arxiv.org/abs/1406.2572
Local minima are something that people thought was gonna be a problem, especially back in the 2000s. They played around with small neural nets on toy examples such as yours, and thought it was intractable. It's the entire reason why neural nets fell out of the fashion in the early 2000s, and people moved towards techniques like SVM.
These toy examples don't generalize to high dimensions, and if you take a look at the literature, you'll see that the consensus agrees with my statement.