Hacker News new | ask | show | jobs
by shmatt 1099 days ago
My bet is that investors like Andreessen Horowitz and Sequoia Capital are pulling their hairs out, seeing Reddit lose money at the end of the month, after Open AI and others are raising trillions by scraping their data
3 comments

Not reddit data, users data.
This is the key. Reddit owns nothing except a very poorly performant app that no one wants and a very forced and shitty web experience unless you are on old.reddit.com.

Its mind boggling how wrong they've been getting it for as long as they've been getting it wrong. Like how can any company/ CEO be that dense?

Doesn't Reddit own the data now? That's the unfortunate truth of these platforms. They offer your a platform and pay for hosting and you give them your attention and content.
Legally, they don't own the data, but they do have a perpetual license to the data and can do basically whatever they want it with it. Not much of a difference but it's one of these things where the details might be crucial.
I haven't tested it but if as a user you were to ask for your days to be deleted they'd have to comply, at least in EU. So if there was a mass walkout a lot of comments and post would disappear.
No need to ask, just use Power Delete Suite: https://github.com/j0be/PowerDeleteSuite
Yes, this is true. However, the fact that reddit was where they got it means OpenAI capitalized on something that was given for free, and now reddit is trying to do the same and failing (because they have no value proposition).

It's sad that the place where all the users stored their data is getting burned down because it offered free access.

Exactly, when users start migrating, Reddit is dead in the water.
Not your keys, not your coins
There's no way to win this battle, though. Are they going to ... stop the Internet? People will find a way to siphon data from sites where people contribute large volumes of data.
Wait, you're saying openai was trained on reddit posts?!

No wonder people are scared of AI.

It is well known, yes. This was discussed a way back during the whole glitch tokens deal:

https://www.youtube.com/watch?v=WO2X3oZEJOA

A bunch of those, like " SolidGoldMagikarp" are Reddit users.

Reddit and StackOverflow notably. Most use https://commoncrawl.org/ or a derivative of it.
You can often google Copilot output and find the source StackOverflow post. Saves a step I guess.
Making the "weaponized autism" trope real