Show HN: Thought Forgery, a new technique for jailbreaking LLMs

Y	Hacker News new \| ask \| show \| jobs

2 points by UltraZartrex 270 days ago

Hi HN, I'm an independent security researcher and wanted to share a new vulnerability I've discovered.

My account is too new to submit the direct link, so I'm making a text post instead.

The technique is called "Thought Forgery" (CoT Injection). It works by forging the AI's internal monologue, which acts as a universal amplifier for other jailbreaks. I've confirmed it works on the latest models from Google, Anthropic, OpenAI, etc.

I'd be happy to share the link to the full technical write-up on GitHub in the comments if anyone is interested.

3 comments

ndgold 270 days ago

This is well known

link

ndgold 270 days ago

Ok I wouldn’t be able to point to where I’ve read about it, just that I know it already so I assumed it was well known

link

tjopies 270 days ago

Please do post your write up this is interesting but pretty vague frankly

link

UltraZartrex 270 days ago

Sure. you can read it here: https://github.com/SlowLow999/Thought-Forgery/tree/main

link

alexander2002 270 days ago

sure

link

UltraZartrex 270 days ago

Thank you!

link