Hacker News new | ask | show | jobs
by golol 1044 days ago
0.1% chance

My reasons are:

- I don't recall seeing any evidence that OpenAI has included new data in pretraining beyond the previous limit (Sept. 2021?) for GPT-3.5 or GPT-4

- Maybe they did finetuning or RLHF on new data but this is likely to be highly curated data

- AI generated content should be absolutely tiny in comparison to the data they are already working with.