Y
Hacker News
new
|
ask
|
show
|
jobs
by
justinc-md
1791 days ago
This is exactly the kind of behavior that you'd see with reward hacking in reinforcement learning.
1 comments
grp000
1791 days ago
I think humans are pretty good at finding the optimum strategy. It's only really recently that bots have overtaken us in a lot of highly environmentally complex games.
link