| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by nschiefer 5082 days ago

You are correct. My algorithm is conceptually rather similar to a number of ones recently published. The work closest to mine in the literatures is [1], by Lafferty and Zhai at CMU in 2001.

That said, my method is somewhat different than these in the way that it explicitly treats unlinked documents as distributions over a graph of words and the theoretical framework (based on a theoretical process for document generation) employed to derive it.

[1] http://www.iro.umontreal.ca/~nie/IFT6255/lafferty-zhai.pdf