Hacker News new | ask | show | jobs
by murkt 238 days ago
You will need dictionaries with millions of tokens, which will make models much larger. Also, any word that has too low frequency to appear in the dictionary is now completely unknown to your model.