Hacker News new | ask | show | jobs
by iicc 2341 days ago
> We think it may be due to a hardware problem linked to the server RAM.

Are they using ECC RAM?

1 comments

Sounds like they didn't and the metadata logs got corrupted..

This does also mean that other data would be corrupted too, running ZFS without ECC RAM is frequently warned against.

Running any resilient storage system without ECC RAM is warned against, people just really make a big deal about it with ZFS. If your data in RAM is corrupted before it makes it to the hard drive, pretty much any file system is going to write corrupted data to the drive.
Indeed, ECC should be _default_ these days!

It becomes even worse when RAM Is usually only tested when the computer is built.

I've had several cases where RAM becomes faulty a couple of years down the road.

Recently I had a very weird case of two stick of four went bad due to only moving the computer from one corner to the next without even opening the case.