Y
Hacker News
new
|
ask
|
show
|
jobs
by
hjalle
2540 days ago
Funny experiment and perhaps also useful, but there are crawlers with good intentions[1] that still may ignore the disallows. I don't know of anyone else than the internet archive though.
[1]
https://blog.archive.org/2017/04/17/robots-txt-meant-for-sea...
1 comments
zaarn
2539 days ago
Those crawlers can almost always be recognized by the UA.
link
hjalle
2539 days ago
Yes, no doubt.
link