We tell archive.org about the URI, they crawl it. They handle robots.txt.
In my experience, the site owner must email archive.org support to be excluded from its crawler and archiving.
[1]: https://boingboing.net/2017/04/22/internet-archive-to-ignore...
We tell archive.org about the URI, they crawl it. They handle robots.txt.