| HN Mirror

AlphaStar or AlphaGo were trained on thousands of games and played hundreds of thousands of games to learn what works and what doesn't. This model is told the rules of the game and knows how to use the 4 basic inputs. I don't think it has any notion of strategy, good or bad moves.

But maybe with a feedback loop it will improve.

Pretty cool example though to see its limitations.