Y
Hacker News
new
|
ask
|
show
|
jobs
by
kaesve
835 days ago
There's
https://arxiv.org/abs/2310.00166
, which uses an LLM for intrinsic rewards for a RL agent. They use it on nethack. It was discussed on the TalkRL podcast:
https://www.talkrl.com/episodes/pierluca-doro-and-martin-kli...
1 comments
bubblyworld
834 days ago
Ooh, really cool! Thanks for the links.
link