| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by walletdrainer 262 days ago
	FWIW circumventing various paywalls is probably the bad thing archive.is is being investigated for, not the archiving bit.

2 comments

rekabis 262 days ago

An AdGuard employee working their Reddit subreddit let slip that the legal order that forced them to block those domains (from their ad-blocking DNS) was a - claimed! - result of Archive.today having saved CP and refusing to delete it.

Methinks someone accidentally archived the Epstein files, and the FBI is desperately trying to scrub the unredacted backups before the archive URL becomes well-known. That alone would align somewhat with the CP claim,

link

dogma1138 262 days ago

Do they actually do anything to circumvent paywalls or do websites just whitelist their crawlers?

link

walletdrainer 262 days ago

Websites don’t whitelist their crawlers, they maintain custom bypasses for a wide variety of websites.

If the websites were inclined to whitelist these crawlers, they’d also whitelist archive.org which is actually easy to whitelist. Archive.is is not

link