| > And so perhaps the ethics of web scraping are not so straightforward. It strikes me that the _ethics_ of web scraping are extremely straightforward and cognizable with a terse analysis: * You can respond however you like to my HTTP request, and I can parse your response however I like. Simple, traditional, common. This is the way that conversations have occurred since the dawn of human communication, no? > the legal issues associated with it. But aren't these, without exception, fabrics spun out of the cloth that shields established players with the threat of state violence? This is not particularly new, and seems to fit in the pathetic-and-predictable file. Moreover, the broader cheap attempt to cast this in "intellectual" property terms, and to attach that to protection of artists and creators, warrants a very particular eye-roll for its illogic. |
Because if that's your general principles, you are making the internet much shittier. I still remember the old internet with open SMTP servers, easy-to-use comment forms, and forums which did not require emails and capthas. But people with "You can respond however you like to my HTTP request" attitude ruined it with spam, scam and SEO.
If you only apply this to web scraping, then where do you draw the line and why? Can you scrape at maximum rate server can support? Can you scrape if this requires active action (like account creation?) As long as you scrape, can you also post some links to improve your SEO?