Hacker News new | ask | show | jobs
by hackpelican 155 days ago
At one point it started spitting out its CoT in the comments of the code it’s supposed to be changing.
1 comments

Ah yeah I've seen that too. Definitely seems related.

I suspect this is also something like the "inverse" of a prompt hijacking situation. Basically it's losing track of where its output is flowing to (whereas prompt injection is when it loses track of where its input is flowing from).