| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ashildr 653 days ago
	I was about to post a link to the same URL but archived using singleFile, which looks like the original at amazon. I didn‘t because I realized that I have absolutely no idea what additional information would be hidden in the file. In the worst case any component sent by Amazon and archived into the file may contain PII, even if I am “logged out“. I‘m not saying that singleFile is bad in any way, I‘m using it a lot on multiple devices, but I‘m not sure whether sharing archives is a good idea™.

1 comments

nikisweeting 653 days ago

100%, this is the challenge of archiving logged in content.

It becomes un-shareable unless we use fake burner accounts for capture, or have really good sanitizing methods.

link

ashildr 653 days ago

Even when I‘m logged out I expect at least information on my geographical location to seep into the archive via URLs addressing specific CDN endpoints or similar mechanisms.

link

nikisweeting 653 days ago

Yup, this is why the ArchiveBox browser extension sends URLs to a separate server for archiving with an isolated burner profile.

I should write a full article on the security implications at some point, there aren't many good top-down explanations of why this is a hard problem.

link

ashildr 653 days ago

I know it’s a lot of work but this would be great and it may give readers a deeper understanding into security in general.

link

ninalanyon 653 days ago

How does it save pages that are only available when you are logged in such as social networking pages?

link

nikisweeting 652 days ago

You set up a chrome profile for archiving that's logged into all the sites you want to save. I recommend using burner accounts dedicated to archiving, so you'd have to add them to any private pages/groups you want to archive.

It is possible to use your main account for archiving but there are security risks (you cant share the snapshots without leaking session headers).

link

pbronez 651 days ago

That’s a very cool solution- gives the user explicit control

link