|
|
|
|
|
by gwern
1497 days ago
|
|
They use ALE '51' instead of 57, so I assume not. (Because Montezuma's Revenge is pretty much purely about exploration, and given demonstrations of a successful agent wouldn't be hard, there's not much benefit to training on it here. Gato would probably get a good score, but no one would care. The hard exploration games in ALE are often left out for that reason.) |
|
Last I checked, the only team that has shown good performance on that game is Uber, and from what I recall they used a controversial hack that would be unlikely to generalize to other environments.