|
|
|
|
|
by krethh
122 days ago
|
|
> define communication protocols between them that fail when prompt injections are present There's the "draw the rest of the owl" of this problem. Until we figure out a robust theoretical framework for identifying prompt injections (not anywhere close to that, to my knowledge - as OP pointed out, all models are getting jailbroken all the time), human-in-the-loop will remain the only defense. |
|