Hacker News new | ask | show | jobs
by cratermoon 509 days ago
Is there not a survey paper on RLHF equivalent to the "A Survey on Large Language Model based Autonomous Agents" paper? Someone should get on that.
1 comments

*

1 point by _giorgio_ 0 minutes ago | next | edit | delete [–]

https://arxiv.org/abs/2412.05265

Reinforcement Learning: An Overview Kevin Murphy

    This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement learning and sequential decision making, covering value-based RL, policy-gradient methods, model-based methods, and various other topics (including a very brief discussion of RL+LLMs). 
From: Kevin Murphy [view email] [v1] Fri, 6 Dec 2024 18:53:49 UTC (6,099 KB)