| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by thesehands 1876 days ago
	for all common models (GloVe, fastText, word2vec) the means across word embeddings are tightly concentrated around zero (relative to their dimensions), thus making the widely used cosine similarity practically equivalent to Pearson correlation https://www.aclweb.org/anthology/N19-1100/