Hacker News new | ask | show | jobs
by collyw 3060 days ago
Why the need for machine learning for the second part? It seems like a complicated way to do what you could do with some simple database queries.
1 comments

It's not about some simple word frequencies of words. Some common words like "like" "need" "second" "part" in the dataset of whole documents isn't so meaningful in a specific sentence. Google "tf-idf" will show you more details about this.
Ok, I looked that up. Again isn't this something that Elasticsearch would do without needing to set up a machine learning system?