|
|
|
|
|
by kijin
17 days ago
|
|
So we still don't have a reliable way to separate instructions from data when talking to an LLM, a problem that humans learned how to solve decades ago in areas like SQL and memory safety. But hey, we have these hopefully-not-leaky containers, which are probably implemented with just more system prompts. How long until somebody figures out how to trick Codex into disabling Lockdown Mode for you? |
|
Humans also do not know how to do this reliably, which is why phishing is still a thing and always will be.