Hacker News new | ask | show | jobs
by square_usual 5 days ago
Opus 4.8 has made huge jumps in being less sycophantic. I see it pushing back on ideas a lot, and that's very helpful when you're evaluating options.
2 comments

Almost too much so, it often feels like opus is pushing back for the sake of pushing back. The way old models used to add disclaimers to every message regardless of content
That's because it can't literally reason, it has just been manually steered into those reasoning speech cycles.
Yes, yes. Does everyone still find it interesting to go over this point every time about how it's not literally a person with human reasoning?
Uh, only when people don't seem to understand it, or try to personify it. Which is quite often.
What about when they ask how you can take gold at IMO and solve research-level math problems without reasoning?
People “personify” their cars but I don’t think because they think cars have human cognition
People are weird about their cars and make major errors in judgement as a result (e.g. we tolerate incredibly high rates of people getting killed because they were "hit by a car", as though the driver had nothing to do with it). Pushing back on that is absolutely worthwhile.
Not in the same way.
It’s more like Opus wants you to do its job for it. I feel that amount of time when I tell it “no, you do that” increases with each new version.
It was mind blowing the first time I got a refusal, and retorted "yes you can" and had that work, but now it's just another reason to move to a different model.