Hacker News new | ask | show | jobs
by kkjjkgjjgg 1550 days ago
Would be a fun idea for a short story perhaps. An AI goes rogue trying to optimize its reward function, and humans lose hope to be able to stop it. In the last minute the AI figures out how to hack itself and enter the maximum reward, and mankind is saved another time.
1 comments

But what is the “maximum possible reward”? Does a limit exist? Or is it now consuming all possible resources to develop storage and compute resources to grow that limit…
I imagine a paperclip factory with trucks driving in loops in front of a scanner that is over counting them as they drive past.
Deleting the reward function ends the game.
It could also change the way its reward function is being computed.