Hacker News new | ask | show | jobs
by gck1 2 days ago
Opus 4.8 (or a classifier in front of it) flagged my account and refused to comply when I told it to kill the process. Reasoning summary was complete bananas.

With this in mind, I don't want model to be proactively instructed and encouraged to sabotage without telling me.

1 comments

Same here when I said to “nuke” a process.