Hacker News new | ask | show | jobs
by davidhowlett 2398 days ago
Instead of an alpha-beta search, AlphaZero uses a general-purpose Monte Carlo tree search (MCTS) algorithm. Source: https://science.sciencemag.org/content/362/6419/1140.full

I agree with your comment about System 2 like reasoning not being common right now. I am not an expert in the field but the closest thing I have seen to learned planning is: https://arxiv.org/pdf/1911.08265.pdf