Hacker News new | ask | show | jobs
by the_biot 1430 days ago
Like so many questions of this type, I think the Internet Archive is the answer. They are not a warehouse of stuff stored on media/formats that are forever becoming obsolete -- they store data, keyed by timestamp, sometimes with metadata. How they store it and serve it up is irrelevant, and they will upgrade as needed (I assume this is a continuous process).

If the IA didn't exist, we'd have to invent it.

I suspect if the IA ever decides to properly archive YouTube, they'll interface with the folks that run it directly. Archive Team is, to put it as diplomatically as I can, not a good organization.

2 comments

I don't think that's a complete answer though. The article lists ContentID as the major reason videos get deleted, meaning that copyright trolls could go after IA if they want to. What we desperately need is copyright law reform.
I'm not 100% positive, but I recall a conversation we had on TheEye a while back with some IA reps and they simply can't archive YouTube. I recall a private project to archive just the video Metadata ended up in the hundreds of Terabytes. The videos themselves must be a gargantuan collection. Most YouTube archiving thus far is pretty much done and maintained by private individuals.
The Internet Archive has a lot of stuff they don’t make available over the internet.

A couple of months ago, they sent me a thumb drive of some stuff I requested (for a nominal processing fee).

I'm surprised to hear that, what's the point of keeping stuff they don't make available? What sort of stuff is this?
Well, they do make it available, just not over the internet, I assume for legal reasons.

In my case it was some TV footage broadcast on the evening of September 10, 2001.

> If the IA didn't exist, we'd have to invent it.

You know, a part of the original company vision for YouTube prior to the Google acquisition was really something akin to the IA, in that they did pride themselves with hosting footage of the Indian Ocean Earthquake:

https://www.youtube.com/results?search_query=Indian+Ocean+ea...

Now, acting as diplomatically as I possibly can, I can say that your suggestions of the IA and YouTube interfacing together were at a previous point in time a continuous process. But a number of factors have made direct cooperation between the IA and Google (thereby YouTube) come to a screeching halt.

At this current point in time, we stand at a historical crossroad. And I'm only here to just act as a humble messenger ;)