I was browsing through dat for awhile, but haven't caught up with it lately:
https://github.com/maxogden/dat
Basically disk is so cheap that you should just keep 2 or 3 copies of your data around. And then you can sync them really quickly and do the processing on any one of N machines.
I was browsing through dat for awhile, but haven't caught up with it lately:
https://github.com/maxogden/dat
Basically disk is so cheap that you should just keep 2 or 3 copies of your data around. And then you can sync them really quickly and do the processing on any one of N machines.