Hacker News new | ask | show | jobs
by gojomo 3954 days ago
The gensim library now offers the 'Paragraph Vector' [1] approach to create vectors for longer ranges of text. It's available in a class named Doc2Vec – but I don't think that's what is being used here.

The Paragraph Vector approach can give interesting results for document-similarity, including similarity after certain 'algebraic'-like additions/subtractions of other topics/word-concepts. [2]

[1] http://arxiv.org/abs/1405.4053

[2] http://arxiv.org/abs/1507.07998