Hacker News new | ask | show | jobs
by krsdcbl 561 days ago
Maybe it's just me, or I might be missing a relevant implication - but I'm having a hard time understanding why so many people have become alarmist about the fact, that things that they publish on the web, can and will be scraped?
4 comments

It seems to be mainly a reaction against AI (as opposed to scraping in-general, e.g. for a search engine).

I'm not saying it makes sense, but there is a large and growing idea of: I want my content out in the world, but I don't want companies to use it for training AIs, especially for profit.

Then why use a private company platform with their terms of use agreement?

It's like going to butchery and asking for veagn food.

Might just be one of these: https://en.wikipedia.org/wiki/Availability_cascade

I do find PG's idea of "aggressively conventional-minded" people to be a useful concept: https://paulgraham.com/conformism.html

It's mainly artists that got harmed economically and some/many of them have been pushing this line that scraping and training models on scraped data is illegal or unethical and got neutral left leaning people to agree with them on it.
I think it has to do with the previous article where blue sky promised not to train ai on your data. It’s like they won’t do it but everyone else will.