| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by CuriouslyC 1563 days ago
	The model underlying k-means is that all the data is distributed into k hyperspheres. In the simple 2D case, that means drawing k circles around your data points in a X/Y plot such that the inter-group variance is minimized. This is bad because in the real world, data is typically grouped in an elliptical or irregular way. There are some examples of this at https://stats.stackexchange.com/questions/133656/how-to-unde...

1 comments

Thank you!