|
|
|
|
|
by simonw
350 days ago
|
|
The difference between humans and LLM systems is that, if you try 1,000 different variations of an attack on a pair of humans, they notice. There are plenty of AI-layer-that-detects-attack mechanisms that will get you to a 99% success rate at preventing attacks. In application security, 99% is a failing grade. Imagine if we prevented SQL injection with approaches that didn't catch 1% of potential attacks! |
|
You can't have 100% security when you add LLMs into the loop, for the exact same reason as when you involve humans. Therefore, you should only include LLMs - or humans - in systems where less than 100% success rate is acceptable, and then stack as many mitigations as it takes (and you can afford) to make the failure rate tolerable.
(And, despite what some naive takes on infosec would have us believe, less than 100% security is perfectly acceptable almost everywhere, because that's how it is for everything except computers, and we've learned to deal with it.)