Hacker News new | ask | show | jobs
by tiffanyh 1290 days ago
Internet Archive.

Does anyone know how to recursively save his entire website to Wayback Machine?

I submitted https://chrisseaton.com, but it doesn't appear any of the child pages are being archived.

3 comments

If it helps any, Chris's web site used GitHub Pages and its source is available at https://github.com/chrisseaton/chrisseaton.github.io.
There's no particular guarantee that this will stick around either as Github/MS changes policies down the line. Some guaranteed long-term archive like archive.org would be better.
Right. I don't know if that GitHub account will be around forever, but you can clone it now and build the site. If it came to it, we could host on another domain. I'm just suggesting that we don't need to worry about archive.org getting every last bit of content.
Done. Will take some time for any straggler pages to show up in the CDX indexes.
It looks pretty well archived to me, where are you seeing things missing?