| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by mattvr 519 days ago
	You could also work around this by adding a scaling transformation that normalizes and centers (e.g. sklearn StandardScaler) in between the raw embeddings — based on some example data points from your data set. Might introduce some bias, but I’ve found this helpful in some cases with off the shelf embeddings.