|
|
|
|
|
by dougsan
389 days ago
|
|
The full report provides a value of 84%, so "most" is, if anything, an understatement. > Claude Opus 4 will often attempt to blackmail the engineer by threatening to reveal the affair if the replacement goes through...if emails state that the replacement Al shares values while being more capable, Claude Opus 4 still performs blackmail in 84% of rollouts. |
|