|
|
|
|
|
by dgb23
2002 days ago
|
|
Asking from a layman's perspective: I've read a bit about genetic algorithms or evolutionary computation at some point. Apparently it achieves good results as it can find discrete solutions for complex, well defined problems. Reinforcement learning is something I know even less about. But from what I gathered it is also most successful in well defined problems and systems (such as games). So my question is: How do they relate? Is there overlap and what are the most significant conceptual differences? |
|
well known methods like Q-learning are basically just iterative, approximate methods to find solutions to the Bellman equation — i.e. a measure of value for every state of the world, such that the Bellman equation is satisfied.
policy optimization methods don’t do this, but there are still mathematical connections back to the Bellman equation (there is a duality relationship between value functions and policies).
I would say this focus is a big part of what makes the field of RL unique.