| I wrote about this the other day: - https://simonwillison.net/2023/Oct/14/multi-modal-prompt-inj... If you're new to prompt injection I have a series of posts about it here: - https://simonwillison.net/series/prompt-injection/ To counter a few of the common misunderstandings up front... 1. Prompt injection isn't an attack directly against LLMs themselves. It's an attack against applications that you build on top of them. If you want to build an application that works by providing an "instruction" prompt (like "describe this image") combined with untrusted user input, you need to be thinking about prompt injection. 2. Prompt injection and jailbreaking are similar but not the same thing. Jailbreaking is when you trick a model into doing something that it's "not supposed" to do - generating offensive output for example. Prompt injection is specifically when you combine a trusted and untrusted prompt and the untrusted prompt over-rides the trusted one. 3. Prompt injection isn't just a cosmetic issue - depending on the application you are building it can be a serious security threat. I wrote more about that here: Prompt injection: What’s the worst that can happen? https://simonwillison.net/2023/Apr/14/worst-that-can-happen/ |