|
|
|
|
|
by DanteVertigo
1478 days ago
|
|
"and investigate how much of the usual machinery of reinforcement learning algorithms can be replaced with the tools..." These sounds as if the authors have confidently figured out that the current reinforcement learning formulation is not good enough. On the other hand, I think the recent large language models have showed us that much of the world knowledge is indeed predictive. That, if you can predict accurately (next words), you can understand higher more abstract things. The hypothesis that much of world knowledge is predictive, is very important in the framework of reinforcement learning because that means that with enough General Value functions learned off-policy, one can predict almost anything about the world that is useful to the agent in achieving its goals. (cf the Horde paper). |
|
What do you mean with "understand"? And why are you calling "knowledge" that which is predictive?