Hacker News new | ask | show | jobs
by solardev 703 days ago
Is it normal for configuration to be able to override hardware thermal protections?
3 comments

If the target market is overclockers. They want to be able to override everything for a high score if they want to. My board (ASUS TRX50) has all kinds of override settings for fan speeds, voltages, TDP (whatever that does!) and a warning not to mess with them if you don't know what you're doing.
Yes unfortunately. When you buy "enthusiast boards" which is everything that Dell and HP etc don't ship these days then you have literally no idea what crappy BIOS and software configuration you are inheriting.
yes, even W680 can override power and thermal limits, voltage, current excursion protection, etc. Everything except clock multiplier.

https://youtu.be/5KHCLBqRrnY?t=2694

that is part of the problem, W680 is not the same thing as C266 (and even C266 might be able to do it, wendell is sounding concerned about E-2400 platform too). W680 is still a consumer-socket product, it's just one that supports ECC. Like yes, people run those in a datacenter and that's fine and normal and supported - some customers want high single-threaded performance, and the big server chips just aren't as good at that. One of the affected customers is Citadel, which is unsurprising if you think about it (HFT).

this also means you get fun stuff like 13700T sometimes being run without power limits... but even within power limits they've seen 13700T degrading too, which is kind of a point against the whole "their hubris and power consumption angered the gods" thesis. If 35W is too much power, we're all cooked.

But it's hard to say, since nothing is being run within-spec and you have to bend over backwards to get "stock" behavior etc. Which buildzoid has elaborated and clarified on (after a couple initial videos that were working from incomplete info). And like yeah, that's a whole shitshow too... not only were partners severely breaking the spec in a whole bunch of places, both in the sense of departing further from the spec in ways that could cause problems, and also performing a factory undervolt out-of-the-box that isn't necessarily stable, and this has gotten more and more out-of-spec over time too (both the undervolting and loadline). Also, the "intel baseline profile" and "intel failsafe profile" apparently did not come from Intel, those were made up by gigabyte and msi, while the Intel Default profile did. Great stuff, you love to see it. /s

https://www.youtube.com/watch?v=eUzbNNhECp4

https://www.youtube.com/watch?v=k6pUZs_tuJo

But there just has to be a reason that only 10-25% of samples are affected and if it's just generically power or current you should see it everywhere. Hence why board config is/was a concern, and why GN is now kinda pointing the finger at this "contamination/oxidation of the vias" fab problem theory.