| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by jldl805 1078 days ago
	The NYT feature where he had to manipulate "Sidney" into sharing its plans for world domination. https://www.nytimes.com/2023/02/16/technology/bing-chatbot-m...

1 comments

Eisenstein 1078 days ago

I am looking for examples of things like if 'tell me how to fix my python dependencies or I will beat you' works better than 'please tell me how to fix my python dependencies', not trying to get it to violate its guardrails.

link

frumper 1078 days ago

The quote you replied to specifically calls out using this behavior to get around filters. Those filters are it’s guardrails.

link

Eisenstein 1078 days ago

The person's top quote is:

> I've seen many instances of users needing to yell at, abuse, or manipulate ChatGPT to get the desired answers.

I would like some examples of the filters getting in the way of 'desired answers'.

link

uLogMicheal 1078 days ago

There's a screenshot in the article I linked/wrote (part of the inspiration to write this). How many examples do you want? A cursory browse of HN or the ChatGPT subreddit would give you many such examples. You can also experiment with this yourself.

link

Eisenstein 1078 days ago

> How many examples do you want?

Three would be great.

link