Hacker News new | ask | show | jobs
by mapgrep 5480 days ago
I just run this one liner as an hourly cron job, Tumblr doesn't seem to mind. It gets all my images and other Tumblr hosted media, too.

  httrack "http://myhost.tumblr.com" -O "/path/to/tumblr_backup" "+*.media.tumblr.com/*" "+*.myhost.tumblr.com/*" -N100 -I0 -c2 -%c2 --update
(You'll end up with index-2.html as your top level index so you may want to make a symlink to index.html -- add this line to the above shell script:

  ln -s /path/to/tumblr_backup/index-2.html /path/to/tumblr_backup/index.html
...OR make an alias along the same lines in your webserver conf. This is an artifact of telling httrack to exactly mirror the tumblr paths and telling it not to build a generic httrack index file of its own.)