Hacker News new | ask | show | jobs
by kaesve 835 days ago
There's https://arxiv.org/abs/2310.00166, which uses an LLM for intrinsic rewards for a RL agent. They use it on nethack. It was discussed on the TalkRL podcast: https://www.talkrl.com/episodes/pierluca-doro-and-martin-kli...
1 comments

Ooh, really cool! Thanks for the links.