Y
Hacker News
new
|
ask
|
show
|
jobs
by
collyw
3060 days ago
Why the need for machine learning for the second part? It seems like a complicated way to do what you could do with some simple database queries.
1 comments
plantpark
3060 days ago
It's not about some simple word frequencies of words. Some common words like "like" "need" "second" "part" in the dataset of whole documents isn't so meaningful in a specific sentence. Google "tf-idf" will show you more details about this.
link
collyw
3059 days ago
Ok, I looked that up. Again isn't this something that Elasticsearch would do without needing to set up a machine learning system?
link