It can. That's part of the Blu-ray spec. But it's not standardized in streaming video AFAIK (not that Netflix has to care about that, they have their own player) and, even if the feature exists, somebody still has to go do it.
A speech recognition model can give you a reading on how understandable the speech is and use that information to guide the channel volume in the mixing.
OTOH, a lot of the models end up trained on features that are very different from what humans hear.
When you master for 5.1, because you have the centre speaker, you can have dialogue at 100% volume, then out of left and right, you can have incidental noise, be that music or "atmosphere" also at 100%
(its been a while since I've mastered in 5.1) However, two channels of 100% volume (well 0dbu) is louder than just one channel. Which means that if you have lots of music, wind or other foley it'll drown out the dialogue.
It requires artistic choices from the sound team to make work properly.
Yeah I read the same comment you did, but if you're taking a signal with 6 channels, and only have two channels out, you don't really have any option but send the middle channel to both outputs, else it'll sound far too intense in a single ear.
You mix the dialog louder for the entire piece, and every other sound in the middle. No extreme highs, no extreme lows. General compression with dialog forward choices.
I don't know why compression isn't built in to consumer media devices, it's so often called for (and closely followed by volume normalisation ... but I guess the advertisers veto that).
> I don't know why compression isn't built in to consumer media devices
As always, it depends on the device. Dynamic range compression seems to be a relatively common feature, usually as an option described (inaccurately) with something like "Reduce Loud Sounds" like it is on the Apple TV.
Because other times when it's not dialogue (or even when it is--the busy/crowded street effect) you may not want the center channel gain to be raised during mixdown.
Consumer hardware can only guess. A sound engineer can know.