Hacker News new | ask | show | jobs
by bootsmann 1158 days ago
Overfit in training and then complain that it doesn't generalize well in production, a true classic.
2 comments

If this is what happened, what explains the fact that some of these models performed great for about a year before decaying?
This seems rather dismissive for something actually published in a real journal and not just on arxiv for once, right?
If you check the abstract, then this got published more for describing these drift patterns and showing ways to visualize and detect them rather than dropping the weird statistic the article makes it about.