Y
Hacker News
new
|
ask
|
show
|
jobs
by
benyami
4047 days ago
Check out the Internet Archive FAQ on how to remove a document from their archives.
https://archive.org/about/exclude.php
It looks like they used robots.txt to do that.
1 comments
neil_s
4047 days ago
Huh, so the wild-card user-agent will block not just searchbots, but also archivebots. Wonder how OP managed to get screenshots of archive.org having archives available for those documents.
link