|
|
|
|
|
by peteratt
3246 days ago
|
|
Are there any libraries out there that implement some/most of these best practices and approaches to NLP? From what I've seen, the existing ones (Stanford NLP, OpenNLP) are getting somewhat dated. Many non-PhD people (including me) would find such a library incredibly useful. |
|
I can recommend https://spacy.io for a low fuss solution to get you up and running quickly.
Oversimplifying quite a bit, if spacy focuses on syntax, then gensim focuses on semantics (https://en.wikipedia.org/wiki/Distributional_semantics). Gensim has an active community and is well documented.
https://radimrehurek.com/gensim/
If you have the data, can spend a few days experimenting and if you want something that can be orders of magnitude faster than deep learning to train, there's vowpal wabbit. Prediction speed is blazing. Results can be nearly state of the art but with a cost that's a great deal less. It's C++ but with bindings for many languages. It's very poorly documented.
https://github.com/JohnLangford/vowpal_wabbit/wiki/Learning%...
I've never taken a gander at Facebook's fastText.