| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by scotty79 682 days ago

This is wonderful.

"As part of developing these new models, we have come up with a new safety training approach that harnesses their reasoning capabilities to make them adhere to safety and alignment guidelines. By being able to reason about our safety rules in context, it can apply them more effectively."

"We made the AI slightly less likely to go rogue and kill us all. You're welcome."