Hacker News new | ask | show | jobs
by jcla1 4565 days ago
Not a full featured api, but a way to scrape all of HN: http://jcla1.com/blog/2013/05/13/crawling-hackernews/

Disclaimer: It's my own blog

edit: Uses HNSearch, so it doesn't violate the robots.txt and can be crawled faster

1 comments

Did you manage to download the whole database that way? Edit: Also, why didn't you use the "start" (offset) parameter?
No, not tried to download it yet. Regarding your question, if you try to use a start > 999 you get this error: "Validation error: max limit is 100, max start+limit is 1000", which is why I avoided that parameter.