Hacker News new | ask | show | jobs
by graypegg 484 days ago
That’s sort of what I was thinking, if the training involved a lot of negative suppression of things that openAI thinks are bad, it makes sense to me that “do X which you inherently don’t want to do!” will result in a big deluge of everything it shouldn’t do.

I feel like if an average non-researcher found this, they would call it a “jailbreak”.