Hacker News new | ask | show | jobs
by techaddict009 4515 days ago
Check this out : http://commoncrawl.org/

Its not exactly what you are looking for but might help you.