Hacker News new | ask | show | jobs
by wraptile 1032 days ago
You can do the opposite since the inception of robots.txt: User-agent: * Disallow: / and then whitelist google bot and whatnot. Most of the web is already configured this way. Just check robots.txt of any major website, e.g. https://twitter.com/robots.txt
1 comments

The Allow: directive was an extension to robots.txt added later.