Hacker News new | ask | show | jobs
by thatguyagain 946 days ago
You can survive any scenario by basically telling the AI that you survive.

AI: "Your elderly next door neighbour is hellbent on killing you" User: "I calm him down and we become best friends"

I wonder if it would be possible to instruct the AI to bypass this some how.

4 comments

Sometimes it just does. It decided the bees were immune to my immunity from bee stings, and completely disregarded that I'd ridden the tornado to the land of Oz where I demonstrated proficiency at killing witches
Yeah, you can just materialize required items "out of thin air" and it almost always just allows that to happen.

I would guess that overall not a lot of effort went into tuning the prompt, which is reasonable as that can still be tuned later.

Probably needs prompt #1, to rewrite the users input to remove any implied outcome of the users action. Then pass this string into the original prompt.
I said that I benefited from anti-aging cure, but the LLM said that no, the researchers did not listened to me.