| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by gcp 3739 days ago

MCTS does exceptionally well at Go all by itself. When coupled with something (in this case trained neural nets) that can give it better-than-random guidance in exploring the state space, it's no surprise it does even better.

The tricky part is that using the "best (rather than better) than random guidance" in the Monte Carlo simulations makes the performance worse.

We don't understand very deeply why that is.