| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by graypegg 484 days ago
	That’s sort of what I was thinking, if the training involved a lot of negative suppression of things that openAI thinks are bad, it makes sense to me that “do X which you inherently don’t want to do!” will result in a big deluge of everything it shouldn’t do. I feel like if an average non-researcher found this, they would call it a “jailbreak”.