|
|
|
|
|
by axel180
2740 days ago
|
|
Ty for asking this question! "Reinforcement learning" has a wide definition, but in this case because we are using a simple feed forward neural network, "reinforce" is more principled by dynamic programming with supervised learning. We are not actually using a "Deep reinforcement learning" algorithm. The idea here is that the net can continue to train, reinforcing its previous understanding with new understandings, if new training data is provided. It may be that we need to clarify the tutorial, as your point is based around unsupervised learning, not having training data. Is it an overloaded term? YES! |
|