|
|
|
|
|
by jefftk
2501 days ago
|
|
robots.txt is a tool to control crawling, not to specify how you would like your site to be displayed (or not) in search results. If you don't want search engines to include your site, set: <meta name="robots" content="noindex">
while to block just Google do: <meta name="googlebot" content="noindex">
See https://support.google.com/webmasters/answer/93710If Googlebot is not respecting robots.txt, and is crawling something it's been instructed not to crawl, let me know and I can file a bug? (Disclosure: I work for Google but not on Search, speaking only for myself) |
|
How do you tell Googlebot to not crawl your site and to not index it either?
Previously, one could use the undocumented "Noindex" directive in robots.txt, but this will be disabled soon: https://webmasters.googleblog.com/2019/07/a-note-on-unsuppor...