|
|
|
|
|
by danShumway
946 days ago
|
|
I have beef with Lakera AI specifically -- Lakera AI has never produced a public demo that has a 100% defense rate against prompt injection. Lakera has launched a "game" that it uses for harvesting data to train its own models, but that game has never been effective at preventing 100% of attacks and does not span the full gamut of every possible attack. If Lakera AI had a defense for this, the company would be able to prove it. If you had a working 100% effective method for blocking injections, there would be an impossible level in the game. But you don't have one, so the game doesn't have a level like that. Lakera AI is engaging in probabilistic defense, but in the company's marketing it attempts to make it sound like there's something more reliable going on. No one has ever demonstrated a detector that is fully reliable, and no one has a surefire method for defending against all prompt injections, and very genuinely I consider it to be deceptive that Lakera AI regularly leaves that fact out of its marketing. The post above is wrong -- there is no 100% reliable way to catch this particular attack with an injection detector. What you should say is that at Lakera AI you have an injection detector that catches this attack some of the time. But that's not how Lakera phrases its marketing. The company is trying to discretely sell people on the idea of a product that does not exist and has not been demonstrated by researchers to be even possible to build. |
|
Of course that’s a known fact among technical people expert in that matter that an impassable defense against any kind of attack of this nature is impossible.