|
|
|
|
|
by endorphone
2304 days ago
|
|
"In practice, I don't know anyone who'd willfully take 10-15% perf hit on GPUs, because of a cosmic ray." Virtually every server in data centers runs on ECC: the notion of not using it is simply absurd. And given that the Tesla V100 gets 900GB/s of memory bandwidth with ECC, versus 616GB/s of memory bandwidth on the 2080Ti without ECC, it's a strawman to begin with. nvidia further states that there is zero performance penalty for ECC. As to whether the requirement is "real", Google did an analysis where they found their ECC memory corrected a bit error every 14 to 40 hours per gigabit. "That's about it." Also ECC memory. Also dramatically higher double precision performance. Dramatically higher tensor performance. Aside from all of that...that's it. |
|