| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by simonw 20 days ago
	On the one hand this is exactly the right solution to prevent lethal trifecta exfiltration attacks. The existence of lockdown mode does however imply that ChatGPT, in its default settings, does not provide robust protection against sufficiently determined data exfiltration attacks!

3 comments

berlianta 20 days ago

Related: Simon Willison’s post on OpenAI’s new Lockdown Mode (he coined the “lethal trifecta” term this is based on): https://simonwillison.net/2026/Jun/5/openai-help-lockdown-mo...

link

jameshart 20 days ago

Related: simonw is Simon Willison

link

berlianta 20 days ago

Yeah I know the source references him (replying to his comment), that's exactly why I'm giving credit where it's due

link

bombcar 20 days ago

It’s important to draw it out explicitly- I didn’t even look at the commentators name until it was mentioned. (If I see pelicans …)

link

alehlopeh 19 days ago

As explained in a child comment, this comment is a callout to other readers, rather than an actual reply to the parent comment. I know that’s been a thing for a long time, but is there a word for this type of comment?

link

gchamonlive 20 days ago

I wonder what robust protection would mean in practice for such a capable tool like an agent...

Looking at the trifecta axis, if we assume we can't control untrusted content, that leaves us to create safeguards for private data access and external communication.

Would it be enough if we had a buffer between when these two happened: access to the environment and access to the web?

link

simonw 20 days ago

Robust protection means blocking any mechanism by which the agent, once compromised, might communicate stolen information back to an attacker.

link

Noumenon72 20 days ago

I hadn't realized that deep research or generating images that I paste into Twitter were possibly exfiltrating my data. Yikes.

link