Hacker News new | ask | show | jobs
by samuellb 3623 days ago
I've used a re-compressed version in the past, with reduced xml markup, and with the most rarely visited articles removed. It was around 1GB when I downloaded it, and it used the proprietary MDX/MDict format. You can find downloads here (from 2010):

http://sh0dan.blogspot.com/

Or you could just download the Wikipedia CD. It contains 5500 articles from 2007:

https://en.wikipedia.org/wiki/Wikipedia:Wikipedia_CD_Selecti...

Neither of those two are very up to date of course, but it shows that it can be done.