Hacker News new | ask | show | jobs
by is_true 616 days ago
maybe they are using commoncrawl, webarchive, yandex as indexes?
1 comments

In addition to those it's also possible they just found a website that published a scraped list back then and got de-indexed for obvious spammy content.

I would not be surprised if there are still some auto generated link directories left from the "golden ages" of blackhat.