|
|
|
|
|
by dror
963 days ago
|
|
I tend to disagree.
I trust most engineers know how to use a library to generate a crytographically save string. I can't say the same about sanitizing the data in a new domain like LLMs. And on top of it, you'd need to have the data be clear and recognizable to the llm, so that it doesn't confuse it. |
|
But with that said, I'm no expert but I think the consensus is that this doesn't work well enough to rely on. I think all the major AI services out there use some kind of two-step process, where one LLM answers the prompt and a second one decides whether the answer is safe to output - rather than a single model that's smart enough to distinguish safe and unsafe instructions.