| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ChatGTP 1043 days ago
	But what is really good and bad? We’re imprinting this into the machines but we don’t always have a grasp on this ourselves. Was the use nuclear weapons at the end of WW2 good or bad ? For example …

3 comments

taylodl 1043 days ago

You don't need to define good and bad, instead you focus on better and worse and the metrics used to measure them. Now the goal is to maximize "better" and minimize "worse." You may recognize this as the essentials utilitarianism. The advantage utilitarianism has is it can be applied algorithmically without passion or emotion - in other words, by AI.

Utilitarianism leads to controversial outcomes, but every decision is defensible.

link

ben_w 1043 days ago

For each thing T, that T is defensible under at least one ethical framework.

Teaching an optimiser AI any of those frameworks, or even any preference ordering or combination function within Utiliarianism (because value({T, T}) doesn't have to equal 2 * value({T})), will lead to it optimising what you said, without necessarily limiting that to situations anything close to the training distribution.

To put it another way: if you run an AB test on a social media site and it observes that people are more likely to engage with content that makes them angry, then tell it to boost engagement "because socialising is always good, obviously" then it will get your users as angry as possible and suddenly you get Buddhists going off and committing surprise genocide before anyone tells you something has gone wrong.

link

taylodl 1043 days ago

I would argue this has been known for decades and is in fact the origin of one of the earliest memes in computer science: To err is human, but to really mess things up requires a computer!

link

defrost 1043 days ago

Opportunistic, and morally equivalant to the destruction of the other 72 Japanese cities, and the bombing of European cities by both the Allies and the Axis.

Good v. Bad seems a tad simplistic.

link

ChatGTP 1043 days ago

I know, but if we're going to be building T1000's, and I'm sure that's on the cards if it was possible, we better have a good answer to this?

link

choudharism 1043 days ago

Does any intelligent life exist on earth which doesn't hold a formative black-and-white answer to that question?

link

ChatGTP 1043 days ago

I don't get what you mean?

Are you saying that everyone feels strong one way or another?

link

choudharism 1043 days ago

I’m saying that it is possible to learn how to optimise for “better” or “worse” decisions without being trained on an opinionated dataset of every single event that has happened. An “intelligence” could exist without necessarily answering your nukes question.

You’re bringing alignment to a capability discussion.

link

ChatGTP 1042 days ago

How do you suppose we do this? It makes decisions based on numbers alone ?

Personally, I don't think there is a right decision, it was just a decision which had an outcome, for some people it was ab absolutely fucking devastating decision.

This is where I think we have blind spots when developing these systems, I don't think there is a "right" or best answer here. Just some answer.

link