| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by axel180 2740 days ago

Ty for asking this question! "Reinforcement learning" has a wide definition, but in this case because we are using a simple feed forward neural network, "reinforce" is more principled by dynamic programming with supervised learning. We are not actually using a "Deep reinforcement learning" algorithm.

The idea here is that the net can continue to train, reinforcing its previous understanding with new understandings, if new training data is provided.

It may be that we need to clarify the tutorial, as your point is based around unsupervised learning, not having training data.

Is it an overloaded term? YES!

1 comments

Rotten194 2737 days ago

Thanks for the clarification!

link