|
|
|
|
|
by Someone
1 day ago
|
|
> I plan on using this as a sort of benchmark for future AI discussions: "how do you plan on separating data from instructions?" You let a second LLM supervise the first, and don’t give the user/customer any way to send information to that LLM. For example, you can run a LLM trained to do sentiment analysis on the responses your customer chatbot generates and filter out responses that are impolite. You also can run one trained to flag potential legal issues, thus ‘preventing’ your chatbot from making the wrong promises to users. |
|
It doesn't seem to fundamentally change the attack surface.