Hacker News new | ask | show | jobs
by Matumio 1619 days ago
I think they wanted to express that learning to predict the correct output ("error minimization") puts a limit on the achievable performance. While ranking (not just RL, really) allows to improve beyond the current best-known answer.