Hacker News new | ask | show | jobs
by cadwag 5074 days ago
Local client-side scraping of CG to bring about similar results to Padmapper - I like it.

Noting that CG has recently gone on the offensive, I wonder where they would draw the line. At the moment, everything is done locally and it doesn't look like the extension is communicating with any central source. What if there was a central server that aggregated the results of all the distributed scraping to cache results and a) display them more quickly to users, and b) reduce the number of hits to CG?

Would CG rebuke the extension b/c its communicating with a central server and sharing CG's data in a manner not controlled by CG?

1 comments

I had the idea that browser clients could scrape content as they browse then send that anonymously to a central system that could be used by all, but with the copyright issue now in play it seems that it would give CL due process to shut down such a service.
That's essentially what NotifyWire.com did and got a C&D.
Oh well, good to know the idea was put into practice at least.