Hacker News new | ask | show | jobs
by twohearted 2394 days ago
I assume this has been tried but what happens if you give MuZero a goal like "keep the system/process that spawns me running as long as possible?"
2 comments

Why do you assume this has been tried? It's not even clear what the game is. In this setting, what state and actions would the algorithm have access to?
In some games it could find an equilibrium where it could keep the game going on indefinitely by moving back and forth, for example (which won't work in a game like Go[1], though).

1: https://en.wikipedia.org/wiki/Rules_of_Go#Ko_and_Superko