Hacker News new | ask | show | jobs
by kakadzhun 1189 days ago
If Reinforcement Learning is anything to go by, then a naive implementation of learning from past models will overfit to the previous model and start performing worse than even earlier models.

There was a paper by someone @ Microsoft who tried to train a boardgame playing AI like this. The "best" models started losing to beginner level players from some point onwards.