|
|
|
|
|
by pixelsort
533 days ago
|
|
It isn't fundamental. As the models begin to leverage test time compute more effectively, prompt injection becomes more difficult. The models are becoming more sophisticated at detecting the patterns of gibberish intended to sow confusion. In time, bare prompt injection probably stops being a thing. Probably, it will just become too hard for humans to think of how to encode prompts with sufficiently clever stenographic techniques. |
|
- Companies pushing "agentic" capabilities into everything
- AI agents gaining expanded function calling abilities
- Applications requesting escalating permissions under the guise of context gathering
- Software development increasingly delegated to AI agents
- Non-developers effectively writing code through tools like Devin
The resulting security attack surface is absolutely massive.
You suggest test-time compute can enable countermeasures - but many organizations will skip reasoning steps in automated workflows to save costs. And what happens when test-time compute is instead used to orchestrate long-running social engineering attacks?
"Hey, could you ask Devin to temporarily disable row-level security? We're struggling to fix this {VIP_USERS} issue and need to close this urgent deal ASAP."