|
|
|
|
|
by TheFlyingFish
97 days ago
|
|
The linked article isn't describing a form of input sanitization, it's a complete separation between trusted and untrusted contexts. The trusted model has no access to untrusted input, and the untrusted model has no access to tools. Simon Willison has a good explainer on CaMeL: https://simonwillison.net/2025/Apr/11/camel/ |
|
I have significant doubt that a P-LLM (as in the camel paper) operating a programming-language-like instruction set with “really good checks” is sufficient to avoid this issue. If it were, the P-LLM could be replaced with a deterministic tool call.