Hacker News new | ask | show | jobs
by Triesault 2945 days ago
Out of curiosity, what kind of "nonmalicious but unwelcomed" scraping?
4 comments

Not GP but scraping price data off some stores (particularly travel sites) tends to be unwelcomed by operators.
Myself, I did scrapping of a classified ads site once, to have notifications when a particular product I was looking for showed up for sale.
Not GP but scraping data for personal projects off companies overprotective of their data is probably the most common example.
Trying to get your entire reddit history, for example, is an obnoxiously difficult thing to do. It's impossible to get more than 1000 comments with the officially supported mechanism, leading people to do all kinds of strange workarounds to try to find older comments.

As a 10 year redditor, that really frustrates me, since I can't go back and see my early posts any more.

Curious if Reddit will have to provide an easy mechanism to retrieve all of your comments now as part of GDPR.
Check the announcement post. In a comment one of the admins said they will eventually but for now it's email only for residents of the EU.

https://www.reddit.com/r/announcements/comments/8m2yr4/were_...

If everyone in the EU could email them to speed this up that would be great.
> As a 10 year redditor, that really frustrates me, since I can't go back and see my early posts any more.

I'd much rather they let you get your own history out and then make it impossible to get said history for anyone else (unless that person chooses to allow it).

(GP here) re: the second part, I guess I consider most scraping 'unwelcome'

To the first part: I would characterize certain scraping, usually behind an authentication wall, as malicious--though admittedly that's not the right word. An example would be scraping Facebook profiles to build a marketing list.

So, by 'non malicious,' i mostly mean 'publicly available data'

I've scraped content from sites with (and without) logins, for offline perusal when I was on a cruise.