Hacker News new | ask | show | jobs
by mafuy 544 days ago
In my understanding, in KataGo, the decision of how long to follow a line is made solely by MCTS via its exploration/exploitation components. These in turn are influence by the policy/value outputs of the DCNN. So in practical terms, your statement might just be called true.

The raw net output includes some values that could be used in addition, but they are not used. I don't know if they were ever looked at closely for this purpose.