Hacker News new | ask | show | jobs
by boxed 1460 days ago
I have a disallow all robots.txt for a production system. Have had from the beginning.

Bing indexes it. This is my first major security incident and I have no idea how to fix this without making everything totally shitty for the users.

1 comments

Some services ignore global disallow, but will respect rules explicitly targeted at them.
I've put in a hard block for all crawlers on all pages. Works for my scenario I think. Hopefully they don't lie in their user agent. Then it's going to be really bad.