Hacker News new | ask | show | jobs
by stfu 5345 days ago
Kinda obvious that Facebook is getting scraped up and down for all sorts of reasons. Wasn't there some "art" project of some guy guy who recently scraped millions of (public) profiles?

"used programming interfaces from ihearthquotes.com" seems to be down/unknown the the googles?

2 comments

>Wasn't there some "art" project of some guy guy who recently scraped millions of (public) profiles?

Oh, it's amazing - http://openbook.org/

I think the grandparent post may have been thinking of this: http://www.sott.net/articles/show/223241 ('Dating' Site Imports 250,000 Facebook Profiles, Without Permission)

For the site linked in the parent post, it looks like the site uses the Facebook Graph API search function for public posts and then makes additional queries to show information about the creator of each post. Since you don't need a Facebook account to use it, I suspect they are making the queries via the server. They might be accumulating the profile photo data as they retrieve it, but it doesn't look any different than any other site that uses the server-side Facebook APIs.

It's a typo: iheartquotes.com is up.
Thanks! The site seems somewhat odd in relation to the story. Sure they have some API for pulling quotes, but using this for Facebook scraping seems a bit out of proportion.
The bots used the quotation site to generate dummy content, not for scaping.