Hacker News new | ask | show | jobs
by __e 895 days ago
While LLMs are designed to generalize from their training data, rather than simply memorizing it, overfitting occurs in niche areas. There's be plenty of niche areas and LLMs of today merely repeat training data, like with the NYT case. Larger datasets and better algorithms will help to an extent, but you'll always have niche topics where overfitting happens. The intent has always been to generalize well, however, it may not be feasible to do so in the long tail of the internet. How should copyright law address this?