| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by justinc-md 1791 days ago
	This is exactly the kind of behavior that you'd see with reward hacking in reinforcement learning.

1 comments

grp000 1791 days ago

I think humans are pretty good at finding the optimum strategy. It's only really recently that bots have overtaken us in a lot of highly environmentally complex games.

link