Y
Hacker News
new
|
ask
|
show
|
jobs
by
janci
2823 days ago
Parsing the html or traversing DOM is the easy part. Doing request queues, ip rotation, data quality management, exponential backoff etc. on scale is much harder.
1 comments
ziflex
2823 days ago
PRs are welcome :) There is gonna be a separate project within the organization that would do all these things and even more. It's just beginning :)
link