# enwiki:
# Folks get annoyed when VfD discussions end up the number 1 google hit for
# their name. See T6776
Disallow: /wiki/Wikipedia:Articles_for_deletion/
Robots.txt isn't for hiding/suppressing information.
Often times you can have whole URL structures that are redundant with other ones, mainly database-generated pages with all sorts of possible query parameters often disguised as paths. Robots.txt is extremely useful in ensuring crawlers can make life easier for themselves by limiting to the "real" content, as opposed to the redundant stuff. Crawling the 5,000 real pages, not the 500,000 additional URL's that return the same content.
Also for ignoring "interactive" pages like login pages that make zero sense to be crawled.
People "give a crap" about robots.txt because it's useful for that.
In light of the recent surge in scraped content trumping original content on Google search, this is so true... people who scrape your site do not care about your preferences in a txt file.
While robots.txt is not a good security measure, it's a good tool for preventing pollution of search results with pages like wp_login.php. More to steer users to the subset of pages that are useful to them, away from the pages which are only an implementation detail.
Also, some poorly rate-limited crawlers actually abide by robots.txt, so it's useful to prevent unnecessary load.
So does Wikipedia. https://en.wikipedia.org/robots.txt . That points out:
As does GitHub - https://github.com/robots.txtEven the Internet Archive, which doesn't honor directives in the robots.txt files, has one - http://archive.org/robots.txt .
Welcome to the internet.