|
|
|
|
|
by _jal
1520 days ago
|
|
I don't see this. I have thousands of long-lived instances - full VMs, not containers, running in our hardware. If they start "going bad", something is wrong. That's a signal I wouldn't want to ignore. It has happened - once an HBA in a storage node was causing occasional corruption, another time due to a communication failure people were building things with the wrong version of something which had a memory leak and would eventually summon the OOM killer. There have been other issues. "Have you tried turning it off and back on again" is still a terrible system management strategy. |
|