Hacker News new | ask | show | jobs
by not_a9 55 days ago
FYI this does not work for CTF challenges at least - I’ve seen a lot of rev/pwn challenges try to add magic refusal strings/prompt hijacking and models really don’t give a damn.