Hacker News new | ask | show | jobs
by walletdrainer 215 days ago
FWIW circumventing various paywalls is probably the bad thing archive.is is being investigated for, not the archiving bit.
2 comments

An AdGuard employee working their Reddit subreddit let slip that the legal order that forced them to block those domains (from their ad-blocking DNS) was a - claimed! - result of Archive.today having saved CP and refusing to delete it.

Methinks someone accidentally archived the Epstein files, and the FBI is desperately trying to scrub the unredacted backups before the archive URL becomes well-known. That alone would align somewhat with the CP claim,

Do they actually do anything to circumvent paywalls or do websites just whitelist their crawlers?
Websites don’t whitelist their crawlers, they maintain custom bypasses for a wide variety of websites.

If the websites were inclined to whitelist these crawlers, they’d also whitelist archive.org which is actually easy to whitelist. Archive.is is not