|
|
|
|
|
by simonw
1128 days ago
|
|
I don't think "works well enough" is good enough. We are currently starting to wire LLMs up as AI-enhanced personal assistants - with the goal of giving them access to our email, and the ability to take actions on our behalf. If we widely deploy these systems the incentives for attackers to figure out prompt injection attacks that get last any probability-based filters we are using will be enormous. An attacker only needs to get lucky with their prompt attacks once. I wrote about the larger threat introduced by these new applications here: https://simonwillison.net/2023/Apr/14/worst-that-can-happen/ |
|
Obviously it is a problem for end users, but that’s more of an optics and trust angle, just as browser extensions are a privacy nightmare.
I am as worried as you are on this front, given that people appear to be using browser extensions liberally, but that doesn’t mean all apps will be equally vulnerable. It is as implementation-dependent as it is dependent on the nature of today’s models.
I appreciate your writing on the topic, it is both a great overview and a reminder to staying engaged while looking for mitigations and best practices.