Hacker News new | ask | show | jobs
by 0xbadcafebee 2 days ago
OpenAI already did this when it released its "super scary advanced" security model. They silently return an earlier model's results if they think you're redteaming/abusing with it. https://openai.com/index/scaling-trusted-access-for-cyber-de...
1 comments

They din't get as much pushback because they aren't the leader.