Y
Hacker News
new
|
ask
|
show
|
jobs
by
murkt
238 days ago
You will need dictionaries with millions of tokens, which will make models much larger. Also, any word that has too low frequency to appear in the dictionary is now completely unknown to your model.