Hacker News new | ask | show | jobs
by Chaebixi 3108 days ago
I think you're misunderstanding the context.

IIRC, this isn't a collection of all tweets, etc (though the Library of Congress is already doing that), it's a curated collection of tweets, many important enough that some journalist somewhere bothered to write an article about them.

1 comments

The Library of Congress Twitter firehose archive never got off the ground: https://www.theatlantic.com/technology/archive/2016/08/can-t...
And it's too bad, now that Twitter is actually affecting history.
Yeah, it's a shame! The Internet Archive does some twitter archiving -- I helped a little by finding a listing of government social media accounts -- but it's not enough.
Twitter's policies make it worse, since if you have firehose access you are required to delete tweets when the account deletes them.

This is a well known tactic of many bot-net operators, and has made investigating them much harder.

Even worse, Twitter went and deleted the Russian accounts (and tweets) which Facebook had identified for them...

That makes it bad for the Library of Congress, yes, but they're already not succeeding.

IA doesn't have firehose access.