| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by wongarsu 812 days ago
	Which at its core is probably an alignment problem in the way the models are evaluated: they are measured on their short-term effects, and there exploitation rules. But if you look at the long-term effect of recommendations you really need a healthy dose of exploration to keep your users around.