Hacker News new | ask | show | jobs
by Lewisham 5560 days ago
How are you doing this? Citeseer caches the free versions people upload to their personal sites, so they're already making a clearing house for papers that skirts the legality.

I am guessing you're crawling from behind some library account, but I'm not entirely sure how you'd be avoiding detection from the local library (assuming, perhaps wrongly, reasonable comp sec competency).

2 comments

Maybe it will work like RECAP does for PACER. https://www.recapthelaw.org/ Basically, a bunch of libraries and schools who already have access will grab the articles and send them to the free archive. Of course the legal implications were less intimidating, since the documents (court records) in PACER are in the public domain and they only charge a processing fee.
Yes, that's essentially it...