| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by kaesve 835 days ago
	There's https://arxiv.org/abs/2310.00166, which uses an LLM for intrinsic rewards for a RL agent. They use it on nethack. It was discussed on the TalkRL podcast: https://www.talkrl.com/episodes/pierluca-doro-and-martin-kli...

1 comments

Ooh, really cool! Thanks for the links.