| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by NIL8 1259 days ago
	Fascinating. Now, I want to try it before the humans put a stop to it :)

1 comments

linuxdeveloper 1258 days ago

I failed to replicate the attack later in the evening in a "new" conversation. It does appear to me the model is learning between conversations, even without human input or RLHF.

link