Hacker News new | ask | show | jobs
by monocasa 2810 days ago
That study is faulty. The intern who did that study didn't know that Google would buy DRAM chips that failed manufacturers QA, but them on DIMMs themselves, and retest them at lower frequencies and with ECC turned on. When they already have to be tolerant of any node failing because of their scale, they can start playing fast and loose with this sort of thing if it makes financial sense.

EDIT: At -3 so far, does anyone want to explain the downvotes? I saw the google slides first hand, and there are comments from 2009 in that article saying the same thing.

3 comments

Didn't down it either, but: "The intern" is a well-respected CS professor; the paper had two Google authors whom one would expect to have knowledge of Google's oddities. https://ai.google/research/pubs/pub35162

Your comment provided no substantiation of your claim, merely hand-waving, while casting aspersions on someone else's work.

All ram should be ecc. No one would accept this incorrectness BS if the precedent hadn’t been set by the monopolist.
It's really the memory manufacturers who are enforcing and profiting from the ECC shakedown though, AFAICS.
Intel probably gets more from selling a Xeon.
I didn't downvote, but I'd guess that they (a) want an authoritative reference to the story you're sharing, or (b) figure the point is moot because these errors still happen, even if less often than depicted by the study, and that ought to be enough to justify using ECC.

Like I said though, just a guess.