| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by divideby0 4218 days ago
	Looks pretty awesome, esp the clean DSL for your page model, but it seems like most of the documentation might be missing. How sophisticated is the crawler portion? Does it support Nutch-style generators that crawl more frequently updated pages more frequently? Or is it more designed for focused, one-off crawls a la Scrapy?

1 comments

The crawler portion is about as sophisticated as `wget -R`.