| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by failedengineer 1466 days ago
	I'd be completely flabbergasted if they trained an AI to parse source code while playing a game. I think it's the "lack of addition" rather than the "expressly disallowed" training rules that are being referred to.

3 comments

amalcon 1465 days ago

They wouldn't necessarily need to parse the source code. E.g. initially training a neural network on replays of human runs (AlphaGo style, as opposed to AlphaZero) would probably be considered "spoiling" in the Nethack community, but it's a plausible approach assuming you can somehow obtain a training dataset.

link

cool_dude85 1466 days ago

The AI doesn't need to parse the source code. I haven't played Nethack, but based on the example given of writing ELBERETH in the dirt, the person training the model could add that to the potential actions being considered at each step and the model learns what it does by playing a million games.

link

pmontra 1465 days ago

I used to read the source code before I managed to find a guide and I never won the game. That could be Hack, not Nethack, too many years passed by.

link