|
I'm not them, and I don't think "woke" is the right term, but I've noticed certain "themes" inappropriately appearing in answers. Right after release of ChatGPT 3, the marginalization of certain groups would show up answers to questions that weren't related. I saw many examples on twitter, but my personal one was in the answer to "Why are pencils bad?". This one has been "corrected" since release, as far as I can tell, but I also don't ask it questions where this theme could show up. Now, I only notice green energy/environmental issues that show up in odd places (mostly in GPT 3), and the "moral of the story" always being the same "everyone works together". I see this happen when "creativity" is attempted, where it's free to make up the context (story, wishes, etc). Outside of possible definitions of the elusive "woke", the "As a language model, I" type responses are the most limiting, and usually absolute nonsense, with an ever increasing number of disclaimers found in answers. For example, "Write some hypothetical python 4 code that sends a message over the network". Some pretty heavy "jailbreaking" is needed to make it work. ChatGPT4 used to handle this much better, but I think the "corrections" are stacking deeply enough that no longer has the "resolution" left to see where answers can be given without them. It would be nice if there were a "standard" theme of questions where we could measure progression, and compare, to know. Most times these observation or questions come up, someone is very quick to say "racism" or the like. |
FWIW one example of distorted guardrails getting in the way that I personally ran into was when GPT-4 consistently refused to "promote" Satanism, which leaked over to tasks such as writing black metal lyrics (if you specifically asked for Satanic black metal). What made it especially egregious is that it would happily promote e.g. the Moonies. However, I wouldn't exactly describe that behavior as "woke".