| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by kakadzhun 1189 days ago
	If Reinforcement Learning is anything to go by, then a naive implementation of learning from past models will overfit to the previous model and start performing worse than even earlier models. There was a paper by someone @ Microsoft who tried to train a boardgame playing AI like this. The "best" models started losing to beginner level players from some point onwards.