| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Yokohiii 35 days ago
	There are prompt guard classifiers that can detect prompt injections, but they are not perfect (false positives, obfuscation) and should be only a part of the defense. The concern is real and unsolved. I think security researchers have an advantage here because they still can fall back to manual audits if their automated analysis (or scores thereof) is off.