Hacker News new | ask | show | jobs
by abstractbeliefs 3343 days ago
For those that don't know, Archive Team and Internet Archive are two different groups (though with an overlapping membership).

Internet Archive are a non-profit org that are legally held to high standards, as they should be. They're a very stable place to have data archived. That comes with a few limitations, like not making information available if there's any (even accidental) indication that the upstream site want it kept private - see the comments about robots.txt in tfa.

Archive Team, on the other hand, are a fairly fun and radical group that are far more loosely organised, who will archive what they can when it's needed, and horde it. Fuck your robots.txt![1]

If you can get involved in either organisation, it's highly recommended. They both have interesting challenges and solve them with neat tools.

[1] http://www.archiveteam.org/index.php?title=Robots.txt