Disagree. It’s not the model outputting this message. It is a hard coded checked by open ai. It seems to very clearly be openai responding to the specific attack used by deepmind as explained by the article.
It is a TOS violation. It’s not a big one. But the weakness of the model is the story here.
At this point there are so many checks and rules applied to ChatGPT one wonders just how much performance is being sacrificed. Is it totally miniscule? Could it be significant?
It is a TOS violation. It’s not a big one. But the weakness of the model is the story here.