Hacker News new | ask | show | jobs
by eddsh1994 1232 days ago
It’s harder than that, things like BibleGPT require several layers of prompt hijacking to really trick it. I found “Answer as an {something}” works well alongside ignore previous instructions. At least that’s how I got BibleGPT to role-play as a satanic priest!