Hacker News new | ask | show | jobs
by ssaunier_ 4634 days ago
Don't need to scrape it, just use their API:

https://api.github.com/users?since=0

And iterate on. They even give you a HTTP header for the next URI:

Link: <https://api.github.com/users?since=135>; rel="next"

There is a rate limit of 60 (per hour), but if you use oauth then it's 5000. See http://developer.github.com/v3/#rate-limiting