Hacker News new | ask | show | jobs
by taneq 1191 days ago
Is there a reliable way to 'escape' input? How would you stop the second instance from also being jailbroken by the prompt that tripped up the first instance?
1 comments

Build a different model architecture where the system prompt is a different head than the user prompt and is always equally weighted.

Maybe.