|
|
|
|
|
by simonw
1130 days ago
|
|
Would you trust you trust your private data to a system that was documented to fail to protect against 1/100 SQL injection vulnerabilities? I wouldn't. The difference between this and Apple releasing a security update is that when a traditional vulnerability is reported against an Apple product they can research the root cause of that vulnerability and produce a fix that they are certain is effective. Prompt injection (currently) doesn't have fixes that work like that. |
|
It’s also a matter of attack surface. SQLite, in our example, is also not as wide as an entire OS. In my experience the best prompting is unitary, pure function-like, and that is way more manageable that the open field that is a no-capabilities chat.
What are your thoughts on this?
I don’t see why the reporting model couldn’t work with in-house or external prompt injection detection mechanisms if eval-based. Root-cause analysis can also be done with GPT-3.5. That’s how I put Geiger together. Again, not perfect, but better than a security or development stand-still.