Hacker News new | ask | show | jobs
by divideby0 4171 days ago
Looks pretty awesome, esp the clean DSL for your page model, but it seems like most of the documentation might be missing. How sophisticated is the crawler portion? Does it support Nutch-style generators that crawl more frequently updated pages more frequently? Or is it more designed for focused, one-off crawls a la Scrapy?
1 comments

The crawler portion is about as sophisticated as `wget -R`.