| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by post-it 84 days ago

> I'll check it too by asking "are you just placating me?" the funny thing is that often it'll admit that, yes, it wasn't being very critical, and then procede to over correct and become a complete contrarian. and not in a way that's useful either.

It's not admitting anything. Your question diverts it down a path where it acts the part of a former sycophant who is now being critical, because that question is now upstream of its current state.

Never make the mistake of asking an LLM about its intentions. It doesn't have any intentions, but your question will alter its behaviour.

5 comments

godelski 84 days ago

  > Your question diverts it down a path where it acts the part of a former sycophant who is now being critical

I think people really have a hard time understanding a sycophant can be contrarian. But a yesman can say yes by saying no

https://news.ycombinator.com/item?id=47484664

link

cheesecompiler 84 days ago

> a yesman can say yes by saying no

What a great way to summarize LLM behaviour in 2026

link

noisy_boy 83 days ago

>> a yesman can say yes by saying no

>What a great way to summarize LLM behaviour in 2026

Well they have been trained on words spoken by humans and that has been a human behaviour since time immemorial. E.g.: "I do not agree with you that you were wrong. I do apologize for my strong disagreement but we actually do need your continued guidance desperately."

link

Terr_ 84 days ago

> It doesn't have any intentions

Yeah, and in a way it's even worse than that, since there's another layer of cognitive illusion: "It" doesn't exist.

The LLM algorithm is an ego-less document-generator, often applied to growing a document that resembles dialogue between two fictional characters.

So when your human-user character is "asking" the AI assistant character to explain its intentions, that's the same as asking a Count Dracula character to describe what it "really feels like" to become a cloud of bats.

You'll see something interesting, but it'll be what fits trained story-patterns rather than what any mind introspects or perceives.

link

mvkel 84 days ago

> it'll admit that, yes, it wasn't being very critical, and then procede to over correct and become a complete contrarian

Which is also placating you

link

awithrow 84 days ago

Yes exactly! It's very frustrating. My guess is it drives engagement. But some model that tells me what I want to hear all the time is useless.

link

cruffle_duffle 84 days ago

An alternate way of thinking about it is LLMs have no reflection capability. Literally any “reflection” it claims to have about its decision making is made up. It has absolutely no way know that what it said was based on some ancient proverb, the phase of the moon or cold hard rational thought.

link

layer8 84 days ago

I think “admit” here is just a description of what the LLM was saying. It doesn’t imply that the OP thinks the LLM has internal beliefs matching that.

link

awithrow 84 days ago

Correct, it's really anthropomorphizing what gets generated. It's another form of the "you're absolutely right" framing. The interesting thing is sometimes it won't do that. It'll continue to insist that some point or another is still a valid interpretation. I'm some ways it all feels like a complicated way of reading tea leaves

link