Y
Hacker News
new
|
ask
|
show
|
jobs
by
ma2rten
4581 days ago
I would be great if common crawl (or anyone else) would also release a document-term index for it's data. If you had an index, you could do a lot more things with this data.