|
|
|
|
|
by wslh
1180 days ago
|
|
I assume that would be easy to put a guard in ChatGPT for this? I have not tried to exploit it but used quotes to signal a portion of text. Are there interesting resources about exploiting the system? I played and it was easy to make the system to write discriminatory stuff but guard could be a signal to understand the text as-is instead of a prompt? All this assuming you cannot unguard the text with tags. |
|
If you can come up with a robust protection against prompt injection you'll be making a major achievement in the field of AI research.