Y
Hacker News
new
|
ask
|
show
|
jobs
by
astrange
47 days ago
That shouldn't happen as long as the autoencoder isn't used as an RL reward. It will happen (due to Goodhart's law) if it is.
Of course, if you use it to make any decision that can still happen eventually.