Hacker News new | ask | show | jobs
by howeman 3735 days ago
AlphaGo does do a tree search as you say, but the "value" of the stopping node is half the neural network you mention, and another half performing Monte Carlo with medium strength 'players' (2d amateur). These 'players' are really a quick version of the priority network that figures out which moves to play in the first place.
1 comments

In fact, the paper mentions that the policy network (without MCTS) alone performs reasonably well and can beat most other Go bots.