Hacker News new | ask | show | jobs
by echelon 2500 days ago
This is awesome and brings back so many memories! Thanks a ton for doing this.

What methodology did you use to construct this backup?

As an aside, is there any chance whatever remnant of Yahoo still exists might have disks lying around from the Geocities days that weren't formatted? Do you think we could go about getting them?

1 comments

The sites are from the torrent and also from the archive team as well. I had to write some code that went through all sites and update the links. I also at the same time tried to just extract the html body and use that for indexing.... Yahoo! must still have the original sites. Surely they could just put them online as a "Read Only" version. They would have nothing to lose
> Yahoo! must still have the original sites.

Are you sure? It cost money to maintain hardware and infrastructure.