Y
Hacker News
new
|
ask
|
show
|
jobs
by
rescripting
1548 days ago
But what is the “maximum possible reward”? Does a limit exist? Or is it now consuming all possible resources to develop storage and compute resources to grow that limit…
3 comments
janto
1548 days ago
I imagine a paperclip factory with trucks driving in loops in front of a scanner that is over counting them as they drive past.
link
ganzuul
1548 days ago
Deleting the reward function ends the game.
link
kkjjkgjjgg
1548 days ago
It could also change the way its reward function is being computed.
link