Hacker News new | ask | show | jobs
by mschulkind 787 days ago
Aren't you just describing a bag-of-words model?

https://en.wikipedia.org/wiki/Bag-of-words_model

1 comments

Yes! And the follow up that cosine similarity (for BoW) is a super simple similarity metric based on counting up the number of words the two vectors have in common.