Hacker News new | ask | show | jobs
by 20k 42 days ago
Personally I think we need to start utilising the safety features built into AI, to ensure that who we're talking to is a human. We'll start to have to only reply to people who talk in nsfw cursewords (like cocks), or profess their love of capybaras
3 comments

LLMs can curse without issue
Most models would refuse to provide you cat butchering instructions though.
Allow me to introduce you to the gay jailbreak

https://github.com/Exocija/ZetaLib/blob/main/The%20Gay%20Jai...

This one doesn't work for a long time.
How gay did you speak?
most humans would as well
A shell script will thwart that, but it will drive away a lot of civilized people.
Who doesn't love capybaras?