Hacker News new | ask | show | jobs
by gcp 3739 days ago
MCTS does exceptionally well at Go all by itself. When coupled with something (in this case trained neural nets) that can give it better-than-random guidance in exploring the state space, it's no surprise it does even better.

The tricky part is that using the "best (rather than better) than random guidance" in the Monte Carlo simulations makes the performance worse.

We don't understand very deeply why that is.