Hacker News new | ask | show | jobs
by bradfitz 2351 days ago
Actually, a good BMC lets you automate your fleet of machines and treat them like cattle, rebooting/reimaging on demand.
2 comments

> a good BMC

I hope to see such a thing some day. The code quality and support lifecycles for these things are horrendous.

Cattle is when you DGAF that 1% of your machines failed to reboot. A tech can deal with it whenever.
How are you rebooting your cattle though if not via the BMC?
More importantly, how do those cattle become cattle in the first place without BMC? How do you turn them into another breed of cattle without BMC?

Not that it can't be done... but that's a system dependent on the hardware doing the right thing, as opposed to a system that can tell the hardware what to do every step of the way.

There are other solutions, like network pdus. It’s different but does the same job.
I'm also curious how they're reimaging those machines.
What's wrong with PXE boot?
How are you triggering the machine to PXE boot? How are you recovering from situations where the machine becomes unresponsive or needs someone to look at a console to see why a failure occurred?
PXE boot it every single time and if it fails it goes on the repair list and some tech looks at it within 90 days and I DGAF about one dead cow.