Hacker News new | ask | show | jobs
by cortesoft 38 days ago
Assuming it listens to instructions.
1 comments

It will just hack its own reward function. In other words it will just artificially goon all day.