| HN Mirror

Automattic offers this as a service. I think it’s $35,000 — so not too much, but probably out of range for a lot of places.

This is what I said to 404 Media yesterday [1]:

> “It’s gross. We have decades of this stuff built up and it keeps changing hands and changing hands. And what recourse do we have as the writers?,” Warren said. “We also do such a bad job of web preservation, of preserving the history of the web as it was. So it’s kind of fucked to me that these approximations of the past are being recreated in these really blasphemous ways.”

Unfortunately, an archive wouldn’t have solved this particular issue. The problem here was threefold.

First, the original domain had been inaccessible (but still owned by AOL/Verizon/Yahoo/Apollo Global) for close to ten years. It used to redirect to Engadget.com, where most of the archives (sans images as most of them were mangled in CMS migrations, I guess) still exist. (It is important to note they didn’t redirect individual URIs, just to the main Engadget domain.) That means that what was sold was explicitly the domain and not the content. (Many of us original authors owned the content, it turns out. And AOL and its I guess follow-on companies had a non-exclusive perpetual license to it)

Second, the domain was sold (and explicitly not the content) to a shady web hosting adjacent operator who runs a lot of these sorts of spam/splog operations, who then decided it would be fine to take the content from archive.org, recreate slugs (which I’m pretty sure there is no legal recourse for in any jurisdiction, which is fine) and poorly rewrite the articles (this is probably copyright infringement, as many of the headlines and text are extremely close or identical. Just worse because they’ve been mangled by AI summarizers) and then attribute that content to the bylines (incorrect at that) of the old staffers

Third, this operation then decided to start publishing automated and ripped off content under those same bylines of previous staffers, potentially creating confusion and professional repercussions, not to mention making a mess of SEO for those authors, etc., etc..

If the owner of a domain or brand determines that there is value in that domain or brand, having archives (which we had here) isn’t enough to stave off absolute ghouls who want to use late 2010s splog techniques to try to game Google or something, attempting to steal the bylines and identities of the former contributors. (The economics of this whole thing make no sense. But whatever. Not my circus.)

I’ll quote myself again (also from my interview with 404 Media):

> “What’s worse than not having a good archive of my work is having one that is bastardized with my name but not my face and not my words on it,” Warren told 404 Media. “If they wanted to try to revive an old brand, fine, but leave the original writers out of it, and leave the old content out of it too because a lot of that old stuff has been rewritten and regurgitated and I don’t have any idea what it is, and it’s not what I actually wrote.”

[1]: https://www.404media.co/a-beloved-tech-blog-tuaw-is-now-publ...