Hacker News new | ask | show | jobs
by corygarms 134 days ago
These folks must really have their hands full with the 3M+ pages that were recently released. Hoping for an update once they expand this work to those new files.
1 comments

why do we count this in "pages" when it's mostly an email dump
Based on my random poking around through the latest datasets for a few hours, while there are a bunch of emails, I don't know if it's "mostly" emails.

That said, in my opinion they are using "pages" as the metric because it makes the number sound huge.