Hacker News new | ask | show | jobs
by celerity 2823 days ago
I wonder if the Wayback Machine people are using a (potentially more modern) version of the AOPIC algorithm to decide what to archive. I wrote an article about that algorithm (which is similar to the original PageRank, but simpler IMO), and stated that a service "like the Wayback Machine would probably use something like AOPIC." It would be nice to remove that first like from the sentence!

[1] https://intoli.com/blog/aopic-algorithm/

1 comments

No, Heretrix doesn't really do any ranking as it crawls, it's all up to a cleverly chosen seed.

AOPIC looks to me like it's roughly the same as Yahoo's iterative pagerank algorithm, but I didn't look at it that carefully.