Y
Hacker News
new
|
ask
|
show
|
jobs
Tales - a block tolerant web scraper that runs on top of aws and rackspace
(
github.com
)
3 points
by
calufa
4871 days ago
1 comments
rachelbythebay
4870 days ago
>> Tales is design to scrape the web continuously, even when the domain being scraped blocks the scraper server ip; it goes around this problem by fail-overing to a new node (server).
And people wonder why the AWS machines get filtered.
link
calufa
4869 days ago
this is why jcoulds will be supported soon.
link
rachelbythebay
4869 days ago
And people will wonder why the jclouds machines get filtered.
It's not the "where". It's the "what".
link
calufa
4869 days ago
I am just enabling. For me it's not the "what". Its the "who".
link
And people wonder why the AWS machines get filtered.