Y
Hacker News
new
|
ask
|
show
|
jobs
by
turnsout
907 days ago
What is RL?
2 comments
godelski
907 days ago
Reinforcement Learning. They are referencing a concept known as Reward Hacking (see Robert Miles videos for a high level explanation). You may be familiar with the concept already though, see Goodhart's Law.
link
blharr
907 days ago
Reinforcement learning
link