Hacker News new | ask | show | jobs
by jjviana 743 days ago
Looks great! Next step is to do like AlphaGo / AlphaStar and use the MCTS data to train a neural network to act as the value function.