Hacker News new | ask | show | jobs
by tekacs 5 hours ago
What you're describing only applies to security or biotech downgrades. A downgrade related to the model believing that you're doing something related to model development is invisible and silent and internal.
1 comments

Anthropic has reversed that decision. (But that just happened so it might have been true during the article's testing.)
When I reported this, Anthropic sent me an email on Tuesday saying, "You have been approved into the Cyber Verification Program", but it's still downgrading. Is this a bug? What's the point of the Cyber Verification Program if Fable 5 downgrades when you tell it to write secure code?
I don’t think that’s relevant? The change is that it will no longer silently downgrade, and will instead be honest that it’s doing it in all cases.
Not sure if it's wise to trust them again even if they say they reversed it.
I was just coming here to post this reply to myself! You're absolutely right! :)

Honestly so glad to see the reversal.