|
|
|
|
|
by toomuchtodo
410 days ago
|
|
Lots of good replies to your comment already. I'd also offer up Cloudflare offering the option to crawl customer origins, with them shipping the compressed archives off to Common Crawl for storage. This gives site admins and owners control over the crawling, and reduces unnecessary load as someone like Cloudflare can manage the crawler worker queue and network shipping internally. (Cloudflare customer, no other affiliation) |
|