Hacker News new | ask | show | jobs
by gruez 384 days ago
You realize that the reason why em dashes are so prevalent in chatgpt output is that they're present in the training data, ie. newspaper/magazine articles? I get being suspicious of em dashes in a reddit comment or whatever, but I'd expect em dashes from a professionally typeset publication.
2 comments

Professionally typeset publications, or word documents, or regular Internet comments written by iOS users—it converts double-dashes to em-dashes automatically.

I’m convinced most folks noticing this now just weren’t aware of the punctuation before they heard about it in the AI paranoia context.

I’m also convinced a good chunk of Reddit comments are AI spam. But, I mean, we have to imagine anyone running an AI campaign knows to avoid the em-dashes by now.

Even in a Reddit comment they’re not that strange. macOS and iOS automatically turn two dashes in to an em dash when typing, so a lot of posters are probably using them without even realizing it and another portion stumbled across that and thought “oh neat” and kept doing it. Same goes for smart quotes.

So it’s just as likely that by spotting those features in a post you’ve found an Apple user as it is that the poster is an LLM.