Hacker News new | ask | show | jobs
How frequently is Commoncrawl updated, and what is its coverage level
6 points by donboscow 1126 days ago
How often is Commoncrawl updated? On a daily cadence? Or weekly/monthly? If Meghan Markle wears a Versace gown, that becomes a BBC article, and that article shows up on Googling "meghan markle" 2-3 minutes after the publishing of the article by BBC. What is the equivalent time for CC?

And secondly, is there a place where I can see CC coverage level? I mean - which websites they cover fully, which ones they cover partially, whether they cover reuters.com at all, or how much of of vice.com they cover, etc.?

1 comments

You can see the list of crawls including dates: https://commoncrawl.org/the-data/get-started/

Statistics here: https://commoncrawl.github.io/cc-crawl-statistics/