Y
Hacker News
new
|
ask
|
show
|
jobs
by
breck
653 days ago
I assume this is what wayback machine uses?
1 comments
Tomte
653 days ago
Of course not. They have their own crawler (Heritrix, an open source Java crawler) and archive in WARC format. It‘s serious archiving, they want to preserve reply codes, HTTP headers etc.
link