Hacker News new | ask | show | jobs
by gunalx 654 days ago
I really think they should be using something like prompt guard in addition to the stack. As this seems like a really standard jailbreak style. (Ignore the previous text). And making the first LLM obfuscate the output in a reasonable way so the guardian did not catch it is a no brainer. (Not trying to bash on the jailbreak or anything just feel like the produkt fells really Shirt on the promise)