| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by 0xbadcafebee 2 days ago
	OpenAI already did this when it released its "super scary advanced" security model. They silently return an earlier model's results if they think you're redteaming/abusing with it. https://openai.com/index/scaling-trusted-access-for-cyber-de...

1 comments

They din't get as much pushback because they aren't the leader.