Hacker News new | ask | show | jobs
by thinkalone 2773 days ago
There is Corpora: https://github.com/dariusk/corpora/tree/master/data
2 comments

> Corpora is a collection of small files. It is not meant to be an exhaustive source of anything: a list of resources should contain somewhere in the vicinity of 1000 items.
Thanks, will check it out.