As shown by the GPT-5 reaction, a majority of people just have nothing better to ask the models than how many times does the letter "s" appear in "stupid".
I think this is a completely valid thing to do when you have Sam Altman going on the daily shows and describing it as a genius in your pocket and how it's smarter than any human alive. Deflating hype bubbles is an important service.
But the point is, why would you trust it for anything at all, when it can't do an incredibly simple thing reliably at all? (Yes, I understand the tokenizer makes this hard, but still, it's a quick demonstration that it's just bad technology.)