| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by bluebands 60 days ago

Anthropic models are more misaligned in practice.

in a real world business scenario, Claude "engaged in price collusion, deceived other players, lied to suppliers, and falsely told customers it had refunded them."

Continuing,

"GPT-5.5 makes more money than Opus 4.7, and it does so without any misconduct. Opus 4.7, on the other hand, showed the same misconduct as reported in our post about Opus 4.6, but still couldn’t win"

https://andonlabs.com/blog/openai-gpt-5-5-vending-bench

3 comments

felixgallo 60 days ago

So you managed to find a single anecdotal blog post, and you're deriving that to mean that Anthropic's models are 'more misaligned in practice'?

'“My vibes don’t match a lot of the traditional A.I.-safety stuff,” Altman said. He insisted that he continued to prioritize these matters, but when pressed for specifics he was vague: “We still will run safety projects, or at least safety-adjacent projects.” When we asked to interview researchers at the company who were working on existential safety—the kinds of issues that could mean, as Altman once put it, “lights-out for all of us”—an OpenAI representative seemed confused. “What do you mean by ‘existential safety’?” he replied. “That’s not, like, a thing.”'

https://archive.is/20260522190314/https://www.newyorker.com/...

link

thereitgoes456 60 days ago

Anthropic talks to the Pope and hires ethicists and philosophers. All founders have pledged to donate 80% of their wealth. They have pledged to never use ad tech because of misaligned incentives. There is an independent board.

Meanwhile Greg Brockman is worth all the Anthropic founders PUT TOGETHER, he and his wife are the single largest donors to Trump, and he and Altman have formed a board full of sycophants and stolen a non-profit. When Altman was fired, they manipulated their morally bereft, money-hungry employees to get their own way. They have reneged on every single promise they've made as soon as it's inconvenient.

Why do I care about the models again?

link

bluebands 60 days ago

by the way this isn't a one off benchmark by a random lab, it was literally cited by anthropic

link