|
|
|
|
|
by skj
3418 days ago
|
|
It's not clear to me how this is interestingly different from model-based RL, where you learn the state function and reward function, and then use various types of simulation to learn a value function. I guess I'll have to read more than the abstract... |
|
I haven't seen SR before in the RL literature, but the paper argues that this representation is useful for sub-goal identification. I guess I'll have to read more than the abstract as well :)