Hacker News new | ask | show | jobs
by vidarh 1919 days ago
I've seen highly correlated failures on regular hard-drives too. We had a large array of IBM DeathStars that failed approximately one every couple of weeks until the entire array had been replaced, for example.

But nothing like SSDs.

They absolutely can and do fail near simultaneously, but it doesn't even need to be with identical use. I've had multiple SSDs from the same batch fail the same week despite being in different arrays hosting different data, albeit similar usage patterns. If you're unlucky and get a bad firmware revision, suddenly you may face a cascade of failing drives before you have time to upgrade (I particularly remember a bad time dealing with failing OCZ SSDs...)

It's terrifying. My home NAS has four different brands for that reason. And of course I never trust a single array.

Dealing with storage has done more than anything else to make me worry about hardware risks... I really don't envy you running a storage service...

EDIT: IBM DeathStar refers to this, btw: https://en.m.wikipedia.org/wiki/Deskstar - see particularly the images. It was grim.

1 comments

> I've seen highly correlated failures on regular hard-drives too.

Yes, very mysterious too, until you discover that when someone made a big hole in the wall of the room containing the HDD storage bay and that the HDDs are covered by dust!

It was a looong time ago but I think I'll never forget opening the door and looking at the mess..