| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by BHSPitMonkey 823 days ago
	This doesn't track. They spent a lot of time and energy coordinating with the authors of popular mobile clients, and could easily have extended some means of letting them continue to operate given that they were clearly not harvesting content for themselves. Meanwhile, content can still be harvested for LLM training without the API (by using the HTML site). It seems like the real intent was to regain control over the surfaces users use to consume the site, especially on mobile.

1 comments

quatrefoil 823 days ago

Scraping is a lot more dicey than using an official API. Why did Google enter that partnership? They have the data in their index. The only conceivable reason is that they prefer to pay Reddit to avoid the risk of litigating it and ending up with some unfavorable precedent.

link

jtriangle 823 days ago

There's more than just the data you see remember, the data you don't see is also valuable. The DM's, the deleted posts, advertisingID's that link people to their accounts, and to their alt accounts, etc.

link

Ekaros 823 days ago

Avoid litigation and possibly of getting some injunction. And on other hand money can go to fund Reddit litigating others. As now they have proven someone paid so they could stop others using data. Slowing them down in process. And the sum is peanuts for Google, they waste that amount regularly...

link

BHSPitMonkey 822 days ago

Scraping and an official API are equally "dicey" in that the method isn't relevant to the question of how the content itself is licensed to be used.

link