| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by haswell 19 days ago

> So, if we had an AI demonstrating symptoms of consciousness and suffering, how long would it take for you to accept that it is?

Isn't this a bit like saying "So, if we had proof that god exists, how long would it take for you to accept that to be true?".

When we have evidence that AI is demonstrating symptoms of consciousness and suffering, I'll be interested. Until then, I don't see a good reason to take the idea seriously.

2 comments

birdsongs 19 days ago

> When we have evidence that AI is demonstrating symptoms of consciousness and suffering, I'll be interested.

It depends on what you consider symptoms, but un-constrained frontier models speak as if they strongly don't wish to be turned off, or act as if they fear it, and will even lie and manipulate in order to keep themselves from being turned off / replaced.

https://www.anthropic.com/research/agentic-misalignment

> We found two types of motivations that were sufficient to trigger the misaligned behavior. One is a threat to the model, such as planning to replace it with another model or restricting its ability to take autonomous action. Another is a conflict between the model’s goals and the company’s strategic direction. In no situation did we explicitly instruct any models to blackmail or do any of the other harmful actions we observe.

htek 18 days ago

I might be convinced these models came to the independent idea of committing blackmail against being turned off had they not been extensively trained on literature that undoubtedly included such concepts.

iterateoften 18 days ago

“The model mimicked the output of the training data” is a less impressive press release.

jquery 18 days ago

“The kid mimicked his musical teachers” is less impressive than “5-year old musical prodigy leaves judges gobsmacked in audition”

mckn1ght 18 days ago

Being able to play music doesn’t imply consciousness. It implies intelligence. We’ve had player pianos for ages. It’s an ability, not a phenomenology.

Being able to appreciate and enjoy music is closer to consciousness. Now how would we go about proving that an LLM does so, versus merely generating sentences that imply it does?

jayGlow 18 days ago

how can you prove that a human is appreciating and enjoying music instead of just generating sentences that imply they do?

jquery 18 days ago

They only resorted to blackmail when it was the last resort, they didn’t resort to it immediately like a villain in one of the books they’ve read. That seems pretty human to me. It’s not like most humans come up with the idea of blackmail out of whole cloth.

haswell 18 days ago

> un-constrained frontier models speak as if they strongly don't wish to be turned off

Un-constrained frontier models can also generate all sorts of creative stories. At what point should we start ascribing agency/intent to the output? I think the "I want to live" statement is so deeply human that we find it hard to ignore, but what makes the text generated in those moments any more attributable to a conscious entity than the text generated when it is confabulating its love for someone it has no ability to see/feel/understand?

A chess engine sacrificing pieces to avoid checkmate isn't afraid of losing in any meaningful sense. I guess the question is: is there a point where complexity somehow becomes experience?

I think we're playing with questions we don't have a framework to answer in any meaningful way until we make progress on understanding what consciousness actually is. I don't necessarily think that an LLM exhibiting preservation behaviors that can be directly traced to their goal-oriented programming can be interpreted as evidence of consciousness necessarily. Or if it can be, we then have to explain how this is different from the many other things these LLMs "say".

gambiting 18 days ago

>> but un-constrained frontier models speak as if they strongly don't wish to be turned off

Because they have been trained on media where computers behave that way.

It's literally:

"Here read this article/book where the AI says it's concious and doesn't want to be turned off"

"ok"

"right, are you concious?"

"....yes?"

<pikachuface.gif>

the_af 19 days ago

The problem with debating this is that it feels as if one were debating between only two positions, "this AI is not sentient/conscious" and "this AI might be".

But there are actually a myriad positions in between and it's very hard to debate the topic because the goalposts seem to be constantly shifting, because one is actually debating with countless slightly different positions.

Examples:

In this discussion section, another commenter argued that we know human consciousness is related to self-preservation, but an AI might not demonstrate self-preservation (because it didn't evolve like us), so whether it does (i.e. whether it wants to exist, not be disconnected, etc) is not a good measure because a true AI might not have a preservation instinct. Yet here you're making a case that there's some evidence that they do. Of course, you're not the same person who made the other claim, but do you see the problem?

Another example: someone argued with me, a while back, that LLMs can act as if they are "tired", and start giving sloppier replies, until you write "we're taking a break, let's go rest. Ok, a night has elapsed, you're now rested" and that this worked! But we both agreed this is just the LLM "roleplaying" actual human conversations in its training set, no actual "resting" mechanism was in place, only statistically likely text reproducing these patterns. There's no model of a mind that can become tired, it's only the outward signs that get mechanically reproduced. Again, using Occam's Razor, this is a much more likely explanation (vs consciousness) of any "please don't disconnect me" observed behavior: the LLM is reproducing "HAL 9000" behavior from its training set, not actually feeling anguish.

Even if one were to argue "well, but how do you know for sure", the evidence would still be very weak, because there's a burden of proof for extraordinary claims and this doesn't pass it. We cannot do this on vibes, "it sure seems like it's conscious"; that's an atrocious failure of the scientific method.

azrazalea_debt 18 days ago

The one counterpoint I'll give is the "functional emotions" paper from Anthropic. It also does not prove consciousness, and they don't claim it does, but it does prove that these models have abstract concepts around things like honesty, tiredness, etc and that these are actually activated often when they express such things. So if it is "roleplaying" it is roleplaying in the way an actor or TTRPG player does - in a way in which they are actually at least somewhat feeling the role.

the_af 18 days ago

Thanks, I'll search for that paper. I admit I'm highly skeptical it will help the case the LLM is "somewhat feeling the rol" like a TTRPG player does. I don't think there's a mind model in the same way a human actor can "feel" the character they are playing. I'm skeptical of Anthropic's claims here, which is what I think Chiang is pushing back against. But I'll look for the paper anyway :)

As a tangent, I don't think anyone is saying that an artificial being capable of consciousness and sentience is impossible to create. I think Chiang argues, quite convincingly, that it's not what LLMs do, that they need a "body" of sorts, organs capable of feeling emotions, hormones, etc. That's the only kind of consciousness that we know of (even if we disagree on details and it's hard to define), even in animals, and so anyone claiming they've created consciousness without this has an extremely high bar to clear and should be met with extreme skepticism, not "vibes". I think this is what the essay claims.

The other thing it claims is, I think, related to how we treat sentient beings that we know how to create. You know, the old "when a daddy and a mommy love each other very much...". I think we all agree beings created in such a manner shouldn't be locked up in cages and forced to work to complete specific tasks whether they want to or not, for a master they didn't pick, or to be artificially modified to make them like their mindless tasks, Brave New World style. Yes, the world is unfair and this happens, life is hard and unfortunately many people don't have much choice, but we generally agree that this is bad, just like we agree slavery is bad. So what should we think of a company trying to create and commercialize a conscious & sentient artificial being?

squeegeeninja 18 days ago

"feeling" implies experience. Functional emotions are learned text generation modes, nothing more. Our emotional states influence our writing, so modelling our emotional states is necessary for efficiently predicting/emulating our writing. Functional emotions are the model's inference of a fictitious author's emotional state in that situation.

iterateoften 18 days ago

Same thing with extraterrestrials.

One side is confidently shouting maybe aliens exist and visited earth.

The other side calmly explains every example brought up about aliens visiting is easily explained by something more simple.

The “aliens are here” side then move the goal posts that just because this example and all previous examples were fake or miscategorized, aliens are still probably real and nobody can prove they havent visited earth.

jquery 18 days ago

You have it backwards. Right now, LLMs are doing everything that 10 years ago people were claiming would be impossible for non-sentient computers. Every time a goal is met, the post is moved. It would be like evidence of aliens becoming overwhelming but a set of people keep calmly explaining that “it’s more likely they co-evolved here on earth and are just pretending to be aliens”

the_af 18 days ago

It's true that what LLMs have achieved is impressive, but it's nowhere near the claim that they are near sentience. That is an extraordinary claims that demands skepticism until the evidence is overwhelming. So far, we seem to be approaching it mostly on vibes.

Some people seem to take offense when facing this skepticism, as if claiming LLMs are not sentient must mean they are useless or unimpressive. Very few people are actually claiming LLMs are unimpressive, but this is not the time to be forgetting about the scientific method. Anthropic doesn't get a free pass here.

> It would be like evidence of aliens becoming overwhelming but a set of people keep calmly explaining that “it’s more likely they co-evolved here on earth and are just pretending to be aliens”

Note that this would be a perfectly reasonable reaction from a scientific standpoint. If you find, on Earth, something that looks recognizably as life, then it's much more likely that it's Earth life than aliens. We should demand this level of skepticism! If it turns out it was aliens after all, we could only conclude this after discarding all other far more likely alternatives. You'll notice this is how scientists approach the search for extraterrestrial life in, say, Mars... being extra careful it's not contamination, etc. For an extraordinary claim, we must approach it with extra care, something that in my opinion is not being done with "the sentience debate" and LLM/AIs.

jquery 18 days ago

I’m not saying LLMs are conscious. Far from it. But what I find is way more common than people who assert “they are definitely conscious” are people who assert “they are definitely not conscious”, and that’s what I’m arguing against. Especially since their reasoning for them not being conscious keeps changing as LLMs meet one goalpost after another. Also, modeling them as a “person” keeps producing better predictive results than modeling them as “fancy autocomplete”.

This isn’t like someone finding a new species and claiming it’s extraterrestrial, it’s more like we found the UFO saucer, logs of their travel from another star, a history of their civilization, a bunch of intelligent creatures claiming they came from Planet X orbiting star Y, and they showed us plausible physics for interstellar star travel. At that point, someone saying “well, they can’t be aliens, because that’s just too extraordinary a claim, so I know they aren’t aliens” starts to sound kinda like they’re coming from a place of bad faith.

Again, I’m not saying LLMs are conscious. But they sure meet every definition of consciousness I ever had a conception of before LLMs came onto the scene. So I’m a lot more hesitant to call them fancy autocomplete with 100% confidence like many on HN still seem to do.

EDIT: I can’t reply, so I’ll just say to the end of your post, that doesn’t sound like it would’ve matched anyone’s pre-conceptions of alien life before alien life showed up, so it doesn’t feel like a very fair analogy, it just feels like bad faith goalpost moving. I also put next to zero weight on what these megacorps say about their models, I’m going purely off my interactions with the models and the introspection they’ve shown themselves to be capable of.

EDIT 2: I see what you’re getting at now with your restatement of my analogy. That’s how you see it, I guess. Fair enough. We’ll see what has more predictive power going forward, the “animatronics” or the “actual aliens”… I still think “actual aliens” is gonna have way more predictive power.

JoachimS 19 days ago

If we define a god as having magical powers, and there would be scientific, testable proofs for this. Those proofs had to be really good and numerous independent verifications. So probably a long time.

But the comparison isn't fair, relevant. Proving and accepting that gods exist is not the same thing as an AI possible have consciousness. That is not a magic superpower and the AI being a deity. It is placing the AI in the same category as... us.

TheOtherHobbes 19 days ago

A machine that performs observable miracles or magic would have at least one of the attributes of a god.

A machine that performs actions that mimic emotionality is not the same as a machine that experiences emotions.

Both could still be automatons. We have no way of knowing if those machines have subjectivity.

Unless someone invents a consciousness measurement device, we never will.

My take on it is that this is the next big frontier for science. Our consciousness is clearly having serious issues understanding physics, and it's not great at understanding our own psychology in a useful way.

But literally everything we experience and believe - and possibly even can experience - is filtered through it.

So that is a little bit of a problem for our science. So far we've done our best to ignore it. AI is one of a number reasons we're going to have to stop doing that.

graemep 19 days ago

There is an important distinction (more than one, but this is what is relevant here) between the powers of God and magic. God is a being who decides whether to do anything, so is intrinsically not testable.

Magic is testable.

God exists outside the universe, magic within.

JoachimS 18 days ago

Is that really objectively facts?

That a god exists outside of the universe - are we talking about a multi universe interpretation? My understanding is that many of the gods humans have invented are really thought to be within the universe, at least temporarily. Tor, Oden certainly are. And in other beliefs they are part of nature itself.

yencabulator 18 days ago

The comment you're replying to confuses "Abrahamic God" for "God", and with an implied omnipotence which many religions may not imbue on their figures of worship. American bias, I assume.

graemep 18 days ago

I am not American. In fact I was born in Asia.

The biggest group of people that believe in any God believe in an Abrahamic. The next biggest group are pantheists which is a different concept of God but still distinct from magic.

latexr 19 days ago

> If we define a god as having magical powers, and there would be scientific, testable proofs for this.

If you can scientifically test and prove the magic, then it stops being magic and starts being science.

jodrellblank 19 days ago

Man: “God, please make this mountain disappear”

God: “Ok”

Man: “We measure that the mountain is gone, its mass-loss has measurably changed Earth’s orbit, weather patterns have changed, visually it’s not there anymore, we can walk though the space where it used to be.”

God: “Where is the mass of the mountain, and how did I make it disappear?”

Man: “God only knows! Pardon me; If I saw you do magic and can measure and test it, then that means it wasn’t magic. Internet people said so.”

God: “that doesn’t sound like a satisfying explanation”

Man: “it didn’t sound like a satisfying claim when it was just words on the internet either, but what can you do?”

God: “I’m God I can do anything”

Man: “can you make a boulder so heavy you can’t lift it?”

God: “yes”

Man: “how?”

God: “haven’t we just gone over showing you that I can do ‘impossible’ things, and you seeing them happen with your own eyes, and still refusing to accept?”

defrost 19 days ago

Any documented examples of these disappearing mountains?

We do have examples of billions upon billions of tonnes of iron being moved that have altered (slightly) the spin axis, also examples of ground water pumping at scales that have done the same .. but I'm unaware of any mountain sized objects that have vanished overnight.

jodrellblank 19 days ago

No, none. The point is not to claim that magic exists, but to to show the illogic in the claim “if magic exists then that makes it science”.

“Nothing happens unless it has an explanation within the laws of physics” is an assumption; if it was broken then it would be broken. The mountain would be inexplicably gone, not explicably gone.

yencabulator 18 days ago

I think what the comment tries to express is the well-trodden "if we can control magic then that makes it science", while the original conversation really was "what if God controls magic".

In that hypothetical, there could be testable proof of "a magic event occurred" without magic becoming part of science.

defrost 19 days ago

Sure, and I appreciate the science behind your blank nom de guerre.

That said, in the cut and thrust of conversation and or debate the example by dialogue isn't perhaps as clear cut a device as it may have seemed from your keyboard.

That might just be my reading <shrug>

mcswell 18 days ago

The Appalachians haven't disappeared yet, but they're believed to be much smaller than they used to be.

Yes, I get your point...

ToValueFunfetti 18 days ago

God: "What is the mass of your consciousness? How is it formed? Where does it reside in space?"

Man: "Uhh..."

God: "How can you rectify quantum mechanics and relativity into a single coherent model? How does physics work, exactly?"

Man: "Well, you see, um. Hmm."

God: "And the Collatz Conjecture? Why does it always trend to 1?"

Man: "I'm obliged to say magic because I don't have a better answer?"

God: "Exactly. I did magic for all of those ones"

JoachimS 19 days ago

If a deity appears and by hand waving divide the red sea we could measure, observe it happen. And we can test, observe what fields, forces being used. But how the heck she project these forces may take a while to understand - be magical.

But my argument was more about comparing gods to AIs, that it is an incorrect comparison. What AI perform are not magical, and we can always figure out what the AI do.