Hacker News new | ask | show | jobs
by sharkjacobs 349 days ago
better point of reference might be pages-articles-multistream.xml.bz2 (current pages without edit/revision history, no talk pages, no user pages) which is 20GB

https://en.wikipedia.org/wiki/Wikipedia:Database_download#Wh...?

1 comments

this is a much more deserving and reliable candidate for any labels regarding the breadth of human knowledge.
it barely touches the surface
regarding depth, not breadth, certainly