It makes no sense to me that one would train a chatbot on chatgpt conversations and not filter strings that literally say "openai" and "chatgpt". Extreme incompetence.
Excluding OpenAI/ChatGPT generated content without excluding discourse that mentions OpenAI / ChatGPT such as news articles and industry papers seems like a nontrivial problem to solve at scale.