Hacker News new | ask | show | jobs
by rescripting 1548 days ago
But what is the “maximum possible reward”? Does a limit exist? Or is it now consuming all possible resources to develop storage and compute resources to grow that limit…
3 comments

I imagine a paperclip factory with trucks driving in loops in front of a scanner that is over counting them as they drive past.
Deleting the reward function ends the game.
It could also change the way its reward function is being computed.