Hacker News new | ask | show | jobs
by ad404b8a372f2b9 1536 days ago
That's because it uses as only input the description of the company, which is 80% non-informative common words, and 20% buzzwords. It's basically a buzzword detector.

I reckon you could reach the same results with TF-IDF and kmeans.