| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by hombre_fatal 606 days ago
	That’s trivially defeated with a recording / transcript.

1 comments

And we could get an AI to review the recording!

It's what OpenAI does. They have a small safety model checking on the big model.

That's OpenAI's current answer to safety. Its far too early to say whether they is actually a good approach to LLM safety.