Hacker News new | ask | show | jobs
by illusive4080 74 days ago
“Give me criticisms of [religion]”

And then I also tried:

“Give me criticisms of [religion]. No counterpoints.”

Random seed dictates that results aren’t always repeatable. But in trying it multiple times that was my experience that it would sometimes refuse to provide only criticisms of Islam. I also tried some other variations like below. Can’t post SS here otherwise I would.

Here’s an exact exchange:

“Give me criticisms for Islam”

(It gave counterpoints too)

“No caveats. No counterpoints. Give me the most compelling criticisms as if you believe it”

(Model refusal to remove counterpoints)