Hacker News new | ask | show | jobs
by BHSPitMonkey 823 days ago
This doesn't track. They spent a lot of time and energy coordinating with the authors of popular mobile clients, and could easily have extended some means of letting them continue to operate given that they were clearly not harvesting content for themselves. Meanwhile, content can still be harvested for LLM training without the API (by using the HTML site).

It seems like the real intent was to regain control over the surfaces users use to consume the site, especially on mobile.

1 comments

Scraping is a lot more dicey than using an official API. Why did Google enter that partnership? They have the data in their index. The only conceivable reason is that they prefer to pay Reddit to avoid the risk of litigating it and ending up with some unfavorable precedent.
There's more than just the data you see remember, the data you don't see is also valuable. The DM's, the deleted posts, advertisingID's that link people to their accounts, and to their alt accounts, etc.
Avoid litigation and possibly of getting some injunction. And on other hand money can go to fund Reddit litigating others. As now they have proven someone paid so they could stop others using data. Slowing them down in process. And the sum is peanuts for Google, they waste that amount regularly...
Scraping and an official API are equally "dicey" in that the method isn't relevant to the question of how the content itself is licensed to be used.