|
|
|
|
|
by polyfractal
5047 days ago
|
|
I think this will be the solution I go with, but I'm a bit unsure how it would work in practice. I don't know how I would evaluate the accuracy of automated clustering? Another solution might be a sort of automated-manual hybrid: e.g. identify common words/phrases in a particular category manually, write a script to find all items that have those, add to category. |
|
But yeah, something like that would be a good start. I don't know anything about this domain so I'm of limited help here. It might be too hard to categorize based solely on words if they're not distinct enough.