|
|
|
|
|
by Steuard
750 days ago
|
|
I guess I'm not entirely clear on the background/context here. I gather that the tweet's author isn't a serious Nethack player himself, but he is trying to train a neural net to play the game, and his training system somehow takes a model created by someone else as a baseline and tries to fine tune it somehow? But despite Nethack being based on randomly generated dungeons, the other model gets a consistent score every time, somehow? But even though the reference system reliably gets the same score through all the randomization of the dungeon, the game's full moon mechanic somehow throws it off significantly. I feel like I mostly understand most of the pieces of this story when taken individually, but I'm having trouble assembling most of them into a coherent whole. |
|
Which is where the I part of AI always falls down, input that sufficiently differs.
E.g., train facial recognition on a corpus of predominately white American faces, African Americans suffer a horribly high false positive rate when the cops use your model on surveillance footage.