|
|
|
|
|
by JoshTriplett
4923 days ago
|
|
> The archive.org team does follow robots.txt and I believe they remove content retroactively meaning if you update your site with a robots.txt it will delete the old content (which I think sucks). Indeed, especially since most domain parking garbage sites seem to have robots.txt files for some crazy reason. |
|
Presumably to avoid being plagued (in terms of load and bandwidth costs) by the numerous crawling bots looking to update their caches of pages that no longer exist on those domains.