Have you had many conversations with it? For me it took an hour before I found it saying anything particularly wrong and even then it was more subtle than the above.
It can’t do haikus. It very confidently puts them together with wrong syllable counts over and over even though you correct it many times. Then you ask it why it is so bad at counting syllables and it gives a great answer about how it is trained by text and that it doesn’t hear the words so it is hard to count syllables. But it doesn’t explain this when it is putting the haikus together or when you correct it over and over. It is humble when you directly challenge it, but it needs to be more transparent when it is feeding you garbage.
In my experience it takes a lot of leading to get anything interesting - it is very dependent on my prompts. I've 'learned' how to get better output from it, because lets face it, it is boring to try and speak with it naturally and experience the junk it responds with. And the 'very correct' class of which I spoke really does seem to be the exception not the rule.
It often doesn't seem wrong but it's also not right, it's very vague in a lot of places, when you get down to specifics it starts getting really wrong or flip flopping a lot. I had issues with this almost off the bat. It's like Dunning Kruger as a service really.