Hacker News new | ask | show | jobs
by pocketsand 690 days ago
You would think so, but people like Sam Altman have suggested that they can use AI-generated data to train their own models. See here:

https://www.nytimes.com/2024/04/06/technology/tech-giants-ha...

2 comments

At no point should you trust anything Sam Altman says.
Training on ai-generated data isn't a problem, and has been routinely done by everyone for 18 mo +.

The issue is training on 'indiscriminate' ai-generated data. This just leads to more and more degenerate results. No one is doing this however, there is always some kind of filtering to select which generated data to use for training. So the finding of that paper are entirely not surprising, and frankly, intuitive and already well known.