Hacker News new | ask | show | jobs
by b112 1099 days ago
Wait, you're saying openai was trained on reddit posts?!

No wonder people are scared of AI.

3 comments

It is well known, yes. This was discussed a way back during the whole glitch tokens deal:

https://www.youtube.com/watch?v=WO2X3oZEJOA

A bunch of those, like " SolidGoldMagikarp" are Reddit users.

Reddit and StackOverflow notably. Most use https://commoncrawl.org/ or a derivative of it.
You can often google Copilot output and find the source StackOverflow post. Saves a step I guess.
Making the "weaponized autism" trope real