|
|
|
|
|
by lxgr
83 days ago
|
|
It works well so far, for you. Are you confident it would still work against sophisticated prompt injection attacks that override your "strongly worded message"? Strongly worded signs can be great for safety (actual mechanisms preventing undesirable actions from being taken are still much better), but are essentially meaningless for security. |
|
So it’s deterministic based upon however the script it written