Hacker News new | ask | show | jobs
by nschiefer 5082 days ago
You are correct. My algorithm is conceptually rather similar to a number of ones recently published. The work closest to mine in the literatures is [1], by Lafferty and Zhai at CMU in 2001.

That said, my method is somewhat different than these in the way that it explicitly treats unlinked documents as distributions over a graph of words and the theoretical framework (based on a theoretical process for document generation) employed to derive it.

[1] http://www.iro.umontreal.ca/~nie/IFT6255/lafferty-zhai.pdf