|
|
|
|
|
by dvt
3 hours ago
|
|
The paper is correct, but I think that anyone that knows anything about LLMs knows this: > Role tags were a formatting trick that became the security architecture and the cognitive scaffolding of modern LLMs. LLMs are basically some `f(x) → y` where x and y are strings. That's it. Nothing more to it. If you feed it private x (like secret keys) or do dangerous stuff with y (like running arbitrary non-sandboxed code), that's on you. Also, roles were never really meant to be a "security architecture," they were just meant to (a) make training/fine-tuning easier, and (b) make conversational LLMs more useful. |
|
https://arxiv.org/abs/2404.13208