| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Matumio 1619 days ago
	I think they wanted to express that learning to predict the correct output ("error minimization") puts a limit on the achievable performance. While ranking (not just RL, really) allows to improve beyond the current best-known answer.