Hacker News new | ask | show | jobs
by cyanydeez 68 days ago
One could consider that the LLM paradox: If you don't want an LLM talking about how to make a nuclear weapon, you first need to explain to them how to make a nuclear weapon, which increases the likelyhood, despite your admonition, that they would talk about it.

So perhaps you can point your LLM at this and ask it to inverse the rules and make sure user design remains consistent.