Hacker News new | ask | show | jobs
by bubblyworld 543 days ago
This is the point I think - there's no inherent meaning to the scaling factor(s) as far as overall structure is concerned (they're dimensionless, so the units thing isn't a problem), so the outcome of a clustering algorithm should not depend on it.
1 comments

What if you first did PCA on your data?
You tell me? I'm not sure I understand the question.
Basically scale and rotate your data such that the variation in all dimensions becomes equal, so to speak.
Ah I see. As I understand it a general linear map like that isn't what the linked paper means by "scale-invariance", so it wouldn't be considered a violation for a dataset and it's PCA to be given different clusters by your clustering algorithm. It's only the dataset and its scaled up or down counterparts (i.e. the metric is multiplied by a fixed non-zero constant) that are required to get the same clusters for scale-invariance to hold.

In fact the paper doesn't assume that your dataset is contained in a vector space at all. All you have to give a clustering algorithm (as they define it) is a set and a metric function on it.

(the paper if you don't have a link: https://www.cs.cornell.edu/home/kleinber/nips15.pdf)