| There's no such thing as a filtered LLM output. How do you plan on avoiding leaks or "side effects" like the tweet here? If you just look for keywords in the output, I'll ask ChatGPT to encode its answers in base64. You can literally always bypass any safeguard. |
You could as well "Inspect Element" to change content on a website, then take a screenshot.
If you are intentionally trying to trick it, it doesn't matter if it is willing to give you a recipe.