Hacker News new | ask | show | jobs
by viccis 496 days ago
A new jailbreaking method with this level of effectiveness against these models that can produce the entirety of those unsafe outputs?

Yes.

May I see it?

No.

2 comments

Seymour! The house is on fire!
You will see it soon. We thought it may be harmful to publish it before it is patched. Especially because you can basically bypass all the safeguards with it.
Sounds like it won’t be verifiable or reproducible.