Hacker News new | ask | show | jobs
by Scramblejams 2810 days ago
The answer to the question you've asked depends heavily on the particular user and their work, of course. If the user is just playing games, who cares? But maybe they're doing financial calculations or compiling software that lots of users run or maybe they're doing aircraft structural analysis.

In any case, bit flips are much more common than were suspected: https://arstechnica.com/information-technology/2009/10/dram-...

I believe strongly that ECC should be standard, because you can't safely assume that your users are doing worthless work. Apple got this right on (non-Mini) desktops a long time ago. Not yet on laptops, unfortunately.

2 comments

If you’ve got files you care about then care about bit rot. I want ECC so my ZFS volumes don’t silently corrupt.
That study is faulty. The intern who did that study didn't know that Google would buy DRAM chips that failed manufacturers QA, but them on DIMMs themselves, and retest them at lower frequencies and with ECC turned on. When they already have to be tolerant of any node failing because of their scale, they can start playing fast and loose with this sort of thing if it makes financial sense.

EDIT: At -3 so far, does anyone want to explain the downvotes? I saw the google slides first hand, and there are comments from 2009 in that article saying the same thing.

Didn't down it either, but: "The intern" is a well-respected CS professor; the paper had two Google authors whom one would expect to have knowledge of Google's oddities. https://ai.google/research/pubs/pub35162

Your comment provided no substantiation of your claim, merely hand-waving, while casting aspersions on someone else's work.

All ram should be ecc. No one would accept this incorrectness BS if the precedent hadn’t been set by the monopolist.
It's really the memory manufacturers who are enforcing and profiting from the ECC shakedown though, AFAICS.
Intel probably gets more from selling a Xeon.
I didn't downvote, but I'd guess that they (a) want an authoritative reference to the story you're sharing, or (b) figure the point is moot because these errors still happen, even if less often than depicted by the study, and that ought to be enough to justify using ECC.

Like I said though, just a guess.