I'm not talking about the edge cases where it goes off the rails, I'm talking about the way it normally conversates. The way it was trained to do so through RL.
Oh, are you saying that incentivizing engagement (usually to increase ad views and revenue) also implicitly increases bad behaviors and outcomes, and that is why it won't be fixed? That sounds plausible. Even before LLMs the engagement/attention economy had strong negative effects that companies didn't want to address.