Hacker News new | ask | show | jobs
by markburns 4303 days ago
Whenever I see "Natural Language Parser" mentioned anywhere I get excited then a little disappointed because it implies something much more profound.

Not to belittle the tremendous effort, but most projects I have seen are "English Language Parser"s.

Are there any actual generic language parsing projects out there?

That don't try to overfit to English but actually attempt to do a job of whatever quality in whatever language?

Like I'm a native English speaker, I can understand English say 100%, Japanese 80-90%, I can understand a bit of a few European languages and I can identify a bunch of other languages.

It would be wonderful if there were software with this design in mind.

2 comments

> Are there any actual generic language parsing projects out there?

Chalmers University has impressive results on this - http://www.grammaticalframework.org/

That's not what's meant by Natural Language Parsers. Most are trained on datasets in English, but given the right training they can work in other languages. NLP means a bunch of different things from POS tagging to dependency parsing. What this project is doing is semantic parsing.