Hacker News new | ask | show | jobs
Why CommonCrawl is a Disruptive Force in Big Data (myeverwrite.com)
6 points by edmarferreira 5334 days ago
1 comments

Wasn't the major problem with commoncrawl that most of the index data was too old to be useful?

Besides, search index crawling is comparitively cheap compared to the data processing required to make it useful for actual search.