Hacker News new | ask | show | jobs
by nshm 1591 days ago
In general, language detection is surprisingly hard. There is LSTM-based implementation https://github.com/AU-DIS/LSTM_langid which should be better than ngrams.