Hacker News new | ask | show | jobs
by shafqat 6634 days ago
When you refer to classification, are you really talking about clustering? Is it the same thing? I've been looking at document/article clustering for NewsCred, and so far have spent some time with Carrot. Its open source and pretty good...
2 comments

clustering != classification

- classification (also known as categorization) - you have a set of categories into which you expect your documents to be assigned to (the documents will be matched with the existing categories)

- clustering - you expect the algorithm to give you a set of categories (the ML algorithms will find similarities between the several documents and will group them accordingly)

Hi there. I've not heard of Carrot. Do you have a link for it? My Google search brought up some orange things you grow and eat :)