Hacker News new | ask | show | jobs
by grilledchickenw 782 days ago
The wayback machine will always remember: https://web.archive.org/web/20240503105207/https://www.soflo...
4 comments

It doesn’t. You can block their crawler with robots.txt and send DMCA takedown requests for archived pages as the domain owner which they will honor.

Edit: I was under the wrong impression that if you specifically call out ia_archiver in robots.txt they would honor it. It’s been completely ignored since 2017.

If I remember correctly they changed their policy quite some time ago and started to ignore the robots.txt, but not 100% sure about it
robots.txt is a suggestion not a block
Their crawler now ignores robots.txt.
It doesn't say your account though.

It could just let you delete an arbitrary account ;)

please delete "root" here are 20 euros
Indeed - it used to be 30 euros:

https://web.archive.org/web/20230324031319/https://www.soflo...

I guess things are improving?

Or not depending on how expensive their COVID book access lawsuit turns out