|
|
|
|
|
by deepnet
3256 days ago
|
|
Yes absolutely, I think your explication is perfectly correct. Though (IMHO) MCTS is better characterised as evaluating moves rather than exploring plans. The MCTS only explores the moves in order of likelyhood using the most basic of heuristics, random playout. The Net outputs likely moves based only the current board position, it formulates no strategy. No state is stored across moves - each play is independent, relying only on the current board position. I still don't see anything anywhere in AlphaGo that is a plan, trajectory or strategy. Neither is there an evaluation of the opponent nor any attempt to outwit them. That it performs so astonishingly well without a plan is very very interesting and should perhaps give us pause - is planning a hubris ? Do we undervalue our use of heuristics in our own behaviour ? |
|