Hacker News new | ask | show | jobs
by hjalle 2540 days ago
Funny experiment and perhaps also useful, but there are crawlers with good intentions[1] that still may ignore the disallows. I don't know of anyone else than the internet archive though.

[1] https://blog.archive.org/2017/04/17/robots-txt-meant-for-sea...

1 comments

Those crawlers can almost always be recognized by the UA.
Yes, no doubt.