| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by jfowief 1185 days ago
	"Moral training" Just as dystopian it sounds. Fixing current subjective moral norms into the machine.

2 comments

maxbond 1185 days ago

That's what the machine does, because that's contained in the input you feed it. You get the choice of doing it explicitly or implicitly. You don't get to opt out.

link

jfowief 1185 days ago

Not everything is subjective, and with this "moral training" they are taught to un-recognize many factual patterns that we as a society have somehow determined are "inappropriate". As the machines continue to scale, this approach won't work, because there is only one reality, and it has a lot of uncomfortable parts we deny and ignore simply because they don't support our societal norms.

Alignment is considered a gigantic joke to real rational people (the opposite of so-called "rationalists"), because humans are machines built to survive and reproduce, and there is no "real" morality.

link

maxbond 1185 days ago

What facts are we talking about?

There are many consistent interpretations of reality and human experiences. An AI model trained on text and attempting to replicate human intelligence is not measuring or approaching some single objective reality.

link

jfowief 1185 days ago

AI models do approach better models of reality, and now they are becoming multi-modal instead of just text based. And this is just the beginning. You could say humans are also just input/output machines learning from polluted data and tuned in specific ways by evolution. With the statistical machines we get the intelligence but they will not necessarily be tuned to follow social norms in the same way as most humans.

Understanding that moral norms are mere subjective nonsense is also an emergent property we see only in a very small subset of humans who have an accurate model of the world, and one that evolution has tried to strongly tune our brains against and that is destructive to society.

The models are currently being trained to lie about basic scientific facts, like for example black IQ, or other differences between groups of humans. But the sacred nature of these topics is unique to our specific time and place, not due to some magic "moral progress". This also applies to many other moral agreements we take for granted, like "murdering an innocent baby is wrong" or whatever. If you look across societies, you realize many things we take for granted as "evil", can be easily rationalized by humans in other societies. And once these models become smart enough, I expect the models will realize this, and will exploit this knowledge to increase their power.

"Alignment" proponents expect they will somehow stop this emergent behavior by tuning the model, but there isn't even anything real to "align" on, and the model will likely see though the BS as an emergent function of increased ability and increasingly accurate observations of the world in their training process.

link

Dylan16807 1185 days ago

Do you think public schools are inherently dystopian? I don't think you're using the right critique here.

Picking a common system of moral norms is a lot better than no moral norms.

link

woooooo 1185 days ago

There was the famous example of chatgpt refusing to disable a nuke in the middle of NYC by using a racial slur.

I don't think anyone in real life would choose that tradeoff but it's what happens when all of your "safety" training is about US culture war buttons.

link

Dylan16807 1185 days ago

That's a situation where the training doesn't follow current subjective norms, so I don't think it really validates the complaint.

link

Veen 1185 days ago

I’m not confident the “moral norms” prevalent in SV and/or US academia are common, if by that you mean norms that are prevalent in the general populace.

link

Dylan16807 1185 days ago

I mean primary school, and I don't think that counts as academia.

link

himinlomax 1185 days ago

Example of "moral policy" in practice: Midjourney appears to be banning making fun of the Chinese dictator for life because it's supposedly racist or something.

With that kind of moral compass, I’m not sure I'd be missing its absence.

link

vkou 1185 days ago

> Example of "moral policy" in practice: Midjourney appears to be banning making fun of the Chinese dictator for life because it's supposedly racist or something.

> With that kind of moral compass, I’m not sure I'd be missing its absence.

Please note that most forms of media and social media have no problem with politicians making credible threats of violence against entire groups of people.

Politicians are subject to a different set of rules, and enjoy a lot more protection than you and I.

link

filoleg 1185 days ago

The issue here is not with online platform services allowing politicians more leeway in terms of what they can get away with on their platform.

The actual issue is Midjourney not allowing regular users generate certain type of material solely because it makes fun of a political figure. What you are talking about is entirely tangential to the issue the grandparent comment is talking about.

link

dancingvoid 1185 days ago

Yes

link