|
This article forgot the very worst use of robots.txt: User-agent: ia_archiver
Disallow: /
Those two lines mean that all content hosted on the entire site will be blocked from the Internet Archive (archive.org) WayBack Machine, and the public will be unable to look at any previous versions of the website's content. It wipes out a public view of the past.Yeah, I'm looking at you, Washington Post: http://www.washingtonpost.com/robots.txt Banning access to history like that is shameful. |