|
|
|
|
|
by simonw
849 days ago
|
|
This is exactly why I think it's so important that we separate jailbreaking from prompt injection. Jailbreaking is mainly about stopping the model saying something that would look embarrassing in a screenshot. Prompt injection is about making sure your "personal digital assistant" doesn't forward copies of your password reset emails to any stranger who emails it and asks for them. Jailbreaking is mostly a PR problem. Prompt injection is a security problem. Security problems are worth solving! |
|
Maybe just an overlapping set?