Hacker News new | ask | show | jobs
by paxys 1981 days ago
Sites like this have a long tail problem. Yes doing a 100% dump would be best, but 90% of it is likely stuff that hasn't been accessed in years and never will again. So from a resourcing point of view it is better to save "most" than none at all.
2 comments

And yet, all is still better than most. The IA's entire point of existence is to preserve that long tail stuff that doesn't get preserved otherwise, and they certainly currently hold on to things that are far less useful than literally anything on tucows was (for eg. a website I worked on in 1995 that probably only had a few hundred visitors even then).

I hope this is just soft wording.

> I hope this is just soft wording.

Yeah, I hope that too. Maybe everything except 0.01% really problematic content has been transferred to the IA.

I guess I'm mostly concerned about an authoritative dump from those first years until 1996/1997 or so, probably less than 1 GB.
I wonder if someone still has a copy of the Tucows CD-ROMs from those days? (Since most folks were still on dial-up and everything.)