|
|
|
|
|
by languagehacker
3923 days ago
|
|
TextBlob is just an easy-to-use wrapper for a number of more involved libraries, including NLTK and Pattern. As with most things like it, if you're looking to shift off extremely unsophisticated NLP work to a junior developer, this is a good thing. If you're an engineer focused in the NLP space, using this API would be like tying your hand behind your back. It introduces its own performance problems, and obscures a number of configurations that the APIs of the libraries it wraps expose. I also find its attitude towards object-orientation tends to obscure performance bottlenecks by hiding how much just-in-time computation occurs for a given string. Also, I hate to admit this, but the Java/Scala NLP stack is beating out most Python NLP libraries these days. NLTK _just_ got Stanford CoreNLP's best-in-class dependency parser. It's been available in Java for years. |
|
spaCy's native Cython dependency parser is both faster and more accurate than CoreNLP.
The NP chunks example from the post: