| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by coppsilgold 68 days ago
	Much like a developer can insert a backdoor as a "bug" so can an LLM that was trained to do it. One way you could probably do it is by identifying a commonly used library that can be misused in a way that would allow some kind of time-of-check to time-of-use (TOCTOU) exploit. Then you train the LLM to use the library incorrectly in this way.