|
|
|
|
|
by solid_fuel
8 days ago
|
|
Not really an accurate comparison since buffer overflows and sql injection are bugs which ultimately allow user data to co-mingle with executable code. LLMs take user data and mix it with the "executable code" (if we are extremely generous in our description of a user prompt) by design. The issue here is unavoidable because LLMs are broken by design. There is no encapsulation where you can separate instructions and data because LLMs are nothing more than next-token predictors and the input sequence MUST be a sequence. They can't build a model with one stream for instructions and another for data because the training data they stole from the internet and books is a single stream. |
|
That “stolen” training data, most of which itself was stolen from older works, does not include user prompts. It is data, not control.
We will see models with annotations for whether a token is part of user prompt, and other ways as well.
You’re obviously passionate about the subject but as someone who works in the field, I assure you there is no now-and-forever requirement for a single stream with no metadata about tokens. We will positively see control and data separated just like they were for phones and databases.