Hacker News new | ask | show | jobs
by SeriousGamesKit 1129 days ago
Thanks SimonW! I've really enjoyed your series on this problem on HN and on your blog. I've seen suggestions elsewhere about tokenising fixed prompt instructions differently to user input to distinguish them internally, and wanted to ask for your take on this concept- do you think this is likely to improve the state of play regarding prompt injection, applied either to a one-LLM or two-LLM setup?
1 comments

I'll believe that works when someone demonstrates it working - it sound good in theory but my hunch is that it's hard or maybe impossible to actually implement.