Hacker News new | ask | show | jobs
by atrudeau 3755 days ago
Wow, this really took me by surprise. I thought the only input was (s_1...s_final, whowon) where s are statates during training and (s_current) during play, and the system would learn the game on its own. That's the way it worked with the Atari games anyway.
1 comments

I expect the Atari games, if we're thinking of the same articles, had much less strategic depth than playing a Go champion.