Hacker News new | ask | show | jobs
by Symmetry 2402 days ago
Yeah, that seems to be part of the reason that AlphaZero, which has its System2-like alpha-beta search seems so much smarter than AlphaStar which plays like a brilliant somnambulist.

Maybe if we had some way of abstracting out the things a machine learning system implicitly learns so we could deal with them in a more classical AI-like way?

1 comments

Instead of an alpha-beta search, AlphaZero uses a general-purpose Monte Carlo tree search (MCTS) algorithm. Source: https://science.sciencemag.org/content/362/6419/1140.full

I agree with your comment about System 2 like reasoning not being common right now. I am not an expert in the field but the closest thing I have seen to learned planning is: https://arxiv.org/pdf/1911.08265.pdf