Hacker News new | ask | show | jobs
by nathell 3917 days ago
TF-IDF solves an important problem and it's good to know about.

However, in some applications, such as Latent Semantic Analysis (LSA) and its generalizations, there are practical alternatives such as log-entropy [1] that I've found to work better in practice.

[1]: http://link.springer.com/article/10.3758%2FBF03203370#page-1