|
> Because they are so JS-heavy, and reliant on CI/CD pipelines for deployment, on custom CMSes, there is no way to archive them in the way that static pages containing just text and images can be archived on the Wayback Machine. Welcome to the world of digital archiving. It's an enormously complicated space, and even for just my own personal projects and content, I've spent a lot of time thinking about how to ensure things are future proof and can be archived easily. As a simple example, building my personal website atop Markdown ensures that, even if the formatting can't be preserved, the core content will be since it's simple ASCII (yes, that's ignoring issues of long-term digital storage and access and so forth, but at least it's not also a bunch of binary blobs or database formats or whatnot). Equally alarming is that fact that so much of our digital lives aren't even in our control. A historian used to be able to rely on family archives, public libraries, etc, to understand our past. A hundred years from now they'll be looking back and hoping someone somewhere preserved the contents of an S3 bucket before Amazon decided to delete it on a whim... |
One reason I take these is that I have RAM and I have grep and apparently either the people who had the data don't have RAM or they don't have grep, and so while I can ask my local Facebook archive "Er, didn't I write something about anti-freeze?" and get an answer in seconds, Facebook itself will try to suggest I might want pages about anti-freeze, a group that cares about anti-freeze, a sponsored advert for anti-freeze ... and not the thing I wrote.