|
|
|
|
|
by wodenokoto
3778 days ago
|
|
> Reducing the sparsity brought that down to about 3,100 unique words [from 30,600 unique words] What does that mean? Does he remove words that are only said once or twice? Can anyone point me to a text explaining the difference between Identifying Characteristic Words using Log Likelihood and using tfidf. ? |
|