Hacker News new | ask | show | jobs
by Sunspark 1100 days ago
If true, this is interesting. As a knowledge repository, having an archive of it would be fascinating.

The size seems small to me. Only 80 gigabytes? What happened to all the video and images that Reddit hosts? Not everything is imgur or other sites.

That said, 4.5 million is too much that they are asking. It's only data not IP or trade secrets as far as I know. I think they certainly could get 500k for the deletion. I would hope that they would donate some of that VC pocket change to worthy causes.

2 comments

> What happened to all the video and images that Reddit hosts?

Even just the text of comments and submissions is >>80GB per month.

I wouldn't be surprised if the "leak" deliberately excluded this data in favor of more interesting nonpublic data, though.

> Even just the text of comments and submissions is >>80GB per month.

Surely that can't be true after compression?

Not 80 GB/mo, but they're still pretty huge after compression. Zstd compressed comments and submissions for Feb 2023 were ~34 GB.
Given the mentions of confidential data, secrets about censorship, and GitHub artifacts, I would assume that public posts/comments aren't part of it? (if it is actually true)