Hacker News new | ask | show | jobs
by dist-epoch 948 days ago
Is there any backup software which continuously uploads your files to an AWS/GCP/Azure long term storage account that you control and pay for? Something like CrashPlan, which from time to time performs automatic maintenance and deletes old versions of files.
6 comments

Tons of choices: Restic, Borg, Duplicity, Kopia

Though Backblaze is pretty good at what it does and you can set your own encryption key, if that's the concern.

Unfortunately Backblaze requires[1] you to provide them this private key to restore.

[1] https://help.backblaze.com/hc/en-us/articles/360038171794-Wh...

Yeah I mean you gotta trust them on some level. Backblaze could also push a client update that nerfs the encryption. If you've got really sensitive data I'd probably pick something else.
What does restore mean this context? I’ve downloaded files from their online portal without providing a key. Perhaps restoring in this context means having them mail you a hard drive.
Really? I thought the file metadata is encrypted so it needs the key to even identify what files are available to restore.
You can always encrypt yourself and rclone to b2. Likely cheaper if you are not a data hoarder.
Perhaps this can help https://rclone.org/
Arq backup. I have used it before in the past(2-3 years ago) but not using it currently.
My experience with Arq has been terrible so far. The interface keeps glitching up, for a while it failed repeatedly until I gave it a unique permission in Windows services, and the backup process is far from intuitive, with multiple backups showing up as restore options, but each with different file sizes and specific files saved. Overall, just too many ambiguities for it to be reliable.

Oddly, SpiderOak has been a background go-to for years and always worked smoothly for backing up everything I select in a wholesale way, keeping a fairly clear record of what was saved, removed or moved, and adjusting immediately to any file changes or deletes I do. The SO interface is shitty and often freezes, and lacks many basic features like being able to see file sizes or scrolling through long file lists easily, but at least overall, I can quickly and easily see when backups are happening, how they're being done and what's being saved. Also, for restoring files, it's surprisingly fast despite a reputation held by many that it's slow.

Arq is the best set-and-forget option I know of for macOS.
I'm using rclone to sync with Backblaze nightly, executed directly from a cronjob.
Same. Rclone is wonderful because it supports a ton of different backends which makes it super easy to mirror. It's also got some great features like crypt where you can encrypt everything locally, thus sending the data all as ciphertext.
Backblaze software is pretty reasonable. But I have a linux machine, so restic and their B2 storage is a buck or two a month to backup a few computers in my house. (around 185GB of photos, etc)
>Backblaze software is pretty reasonable.

Hard disagree. The jankiness of their software was what made me cancel my sub after several years. Firstly it doesn't follow the OS date and number formatting. It's a minor thing, but it's so annoying having to parse the dumb M/D/Y format and comma thousands separator etc (being da-DK). It's not a deal breaker, but on the other hand, it's such a low-hanging thing that most other software gets right immediately.

But far more importantly, the BB client would some times just decide to re-upload several hundred gigabytes of data that I know for sure didn't change, which makes me wonder if it's just the client being retarded or if the data got lost server-side. And it takes absolutely forever to detect USB harddrives being plugged in. And its log files will grow to absurd sizes, and you're not allowed to purge them or the client will become brain damaged. And one time I needed to do a restore, it took literal days for BB to prepare it and I had to get support involved. I feel I just can't trust the BB stack, the client being the weakest link by far, and backup I can't trust is worthless.

You are probably correct. But there is a reason I put Backblaze client on my mom's laptop, rather than a scheduled powershell task to run restic to a cloud storage. And a scheduled task to run the prune weekly.
Define "continuously". Does the data need to be mirrored immediately upon write?