Hacker News new | ask | show | jobs
by mkl95 1151 days ago
It looks like the wrong AI for the problem. RL should be more successful.
2 comments

AlphaStar or AlphaGo were trained on thousands of games and played hundreds of thousands of games to learn what works and what doesn't. This model is told the rules of the game and knows how to use the 4 basic inputs. I don't think it has any notion of strategy, good or bad moves.

But maybe with a feedback loop it will improve.

Pretty cool example though to see its limitations.

Came here to read this, it'd actually be a very good project (DIY 2048 AI) to take up to get your hands dirty with Reinforcement Learning.