Hacker News new | ask | show | jobs
by marginalia_nu 930 days ago
PageRank is. This is a modification of PageRank. The original algorithm calculates the eigenvector of the link graph.

This algorithm uses the same method to calculate an eigenvector in an embedding space based on the similarity of the incident vectors of the link graph.

1 comments

Hmmm, odd. I was under the impression they used cosine similarity based on page content. Once upon a time, based on that 'memory', I created a system to bin domain names into categories using cosine similarity. It worked surprising well.

Regardless, well done!

Hmm, seems like something that might be used for deduplication maybe?