Hacker News new | ask | show | jobs
by kirillseva 2853 days ago
and perfect vs imperfect information - the policy network has to forecast enemy positions and deduce enemy goals. At least in Go you know full game state