Hacker News new | ask | show | jobs
by exitzer0 759 days ago
as fun as it would be for redditors to now go and burn down any/all content that would feed this bot, you have to believe that this announcement is more of an after-the-fact notification vs a we-are-about-to-do-this piece of information.

either way. there has never been a better time to abandon reddit so ... see you all on Lemmy and here.

1 comments

If it bothers you if OpenAI were to scrape your public posts for training data, why would you quit posting on Reddit yet remain on other public forums like HN/Lemmy?

Seems more like an empty gesture than anything principled.

It bothers some people that Reddit’s value was built by its user base, and then Reddit turned around and abused its relationship with them for profit (pricing out third party apps that were considerably more value than the user-hostile and poor quality first party apps, then attempting to wrest control from and silence mods and community members, pumping up impressions by forcing promoted posts or subreddits, etc.). But I think you already knew that.
I have seen people on Hacker News earnestly suggest 24/7 VR for the elderly as a way to stave off dementia. If OpenAI wants to touch a dataset this radioactive, I welcome them with open arms and a knowing smile.
Included in that mix will be the wise and nurturing influence of 16 year-old angry white male affluent suburb libertarian intellects.

Junior ML engineers can graft on PR-friendly "AI safety" filters all day, but someone knows what evil lurks in the heart of gigabytes of floating point numbers.

HN is known for being one of the most dark and twisted places on the internet.
For me personally it's not a problem that OpenAI can scrape my posts, it's a problem that only OpenAI can scrape my posts.

I don't want one company to have the monopoly to train an AI on everything that me and all my friends post on a particular platform. If we're going to decide that training AIs on scraped data is fine, then everyone should have equal access to the dataset. Otherwise it's just a massive data grab and a massive transfer of power to whoever wins this data race, enacted by some platform owners hoping to monetize their users even more