| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by square_usual 52 days ago
	Opus 4.8 has made huge jumps in being less sycophantic. I see it pushing back on ideas a lot, and that's very helpful when you're evaluating options.

2 comments

lachlan_gray 52 days ago

Almost too much so, it often feels like opus is pushing back for the sake of pushing back. The way old models used to add disclaimers to every message regardless of content

link

NewJazz 52 days ago

That's because it can't literally reason, it has just been manually steered into those reasoning speech cycles.

link

emodendroket 52 days ago

Yes, yes. Does everyone still find it interesting to go over this point every time about how it's not literally a person with human reasoning?

link

NewJazz 52 days ago

Uh, only when people don't seem to understand it, or try to personify it. Which is quite often.

link

CamperBob2 52 days ago

What about when they ask how you can take gold at IMO and solve research-level math problems without reasoning?

link

emodendroket 52 days ago

People “personify” their cars but I don’t think because they think cars have human cognition

link

lmm 52 days ago

People are weird about their cars and make major errors in judgement as a result (e.g. we tolerate incredibly high rates of people getting killed because they were "hit by a car", as though the driver had nothing to do with it). Pushing back on that is absolutely worthwhile.

Not in the same way.

It’s more like Opus wants you to do its job for it. I feel that amount of time when I tell it “no, you do that” increases with each new version.

link

fragmede 51 days ago

It was mind blowing the first time I got a refusal, and retorted "yes you can" and had that work, but now it's just another reason to move to a different model.

link