Hacker News new | ask | show | jobs
by nl 3271 days ago
The largest, most interesting recent public datasets in image and NLP were released by Google.

For example, here are some of their recent NLP datasets: https://github.com/google-research-datasets

In images, OpenImages is theirs, and there are assorted ones derived from YouTube.

Stanford's SNLI is the most recent non-Google NLP dataset which is getting used a lot. Babi (from FB) too, if you count that as NLP