Hacker News new | ask | show | jobs
by turnsout 907 days ago
What is RL?
2 comments

Reinforcement Learning. They are referencing a concept known as Reward Hacking (see Robert Miles videos for a high level explanation). You may be familiar with the concept already though, see Goodhart's Law.
Reinforcement learning