| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ceejayoz 267 days ago
	The AI is trained on human input. It uses the dash because humans did.

3 comments

arthens 267 days ago

I'm skeptical this is the reason:

- Chatgpt uses mdashes in basically every answer, while on average humans don't (the average user might not even be aware it exists)

- if the preference for em dashes came from the training set, other AIs would show the same bias (gemini and Le chat don't seem to use them at all)

link

ceejayoz 266 days ago

> Chatgpt uses mdashes in basically every answer, while on average humans don't

I would not be shocked if an aspect to training is bucketing "this is an example of good writing style" into a specific category. Published books - far more likely to have had an editor sprinkle in fancy stuff - may be weightier for some aspects.

My iPhone converts -- to — automatically. So does Google Docs / Gmail (althought I'm not certain if that's on their end or my Mac's auto-correct kicking in). Plenty of them out there.

> other AIs would show the same bias

Unless they've been trained not to use it, now that a bunch of non-technical people believe "emdash = AI, always".

link

pessimizer 266 days ago

Is that why it uses colorful emoticons, too? Was it trained on Onlyfans updates?

link

ceejayoz 266 days ago

It was trained on everything they could get their hands on.

Yes, it uses emoticons because human writers sometimes use emoticons.

link

chinathrow 267 days ago

Yeah but a dash, at least on my keyboard is a '-', not the one quoted above.

link

Ndymium 267 days ago

En and em dashes are easily accessible on both my laptop's and phone's keyboard layouts and I like using them, just like putting the ö in coöperate. It's sad if this now makes me look like a robot and I have to use the wrong dashes to be more "human".

link

unwind 267 days ago

TIL that some people spell cooperate with an "ö".

As a Swedish native it really breaks my reading of an English word, but apparently it's supposed to indicate that you should pronounce each "o" separately. Language is fun.

link

cap11235 267 days ago

As a native English speaker, it also breaks my reading of "cooperate". Never seen it before. I think parent is just annoyingly eccentric for the sake of it.

link

anonymars 267 days ago

Most commonly seen in naïve, and the New Yorker

link

Ndymium 266 days ago

I admit that latter part is just for whimsy, because I think it looks fun. The dashes I like for their aesthetics and if that makes me eccentric then so be it. They shouldn't distract anyone's reading, or at least they didn't use to before LLMs.

link

Freak_NL 267 days ago

Using umlauts to signal that a vowel is pronounced separately is common in a number of languages (like Dutch).

link

unwind 267 days ago

Yeah, I know.

It's just confusing for us poor Swedes since "ö" in Swedish is a separate letter with its own pronunciation, and not a somehow-modified "o". Always takes an extra couple of seconds to remember how "Motörhead" is supposed to be said. :)

link

inejge 266 days ago

That kind of use technically makes it a diaeresis, not an umlaut.

link

1718627440 266 days ago

But it's not used as an Umlaut here, that's exactly what's confusing. Here this is used as a trema/diaeresis.

link

jnwatson 267 days ago

Em dashes are widely used. The diaeresis is only used in The New Yorker and those that copied their style.

link

justusthane 266 days ago

If you’re using the dash on your keyboard (which is a “hyphen–minus” character) in place of a en dash or em dash, then you are using the wrong character. That’s fine — it’s certainly more convenient, and I wouldn’t call you out on it — but it’s silly to assume that other people don’t use the correct characters.

https://www.grammarly.com/blog/punctuation-capitalization/da...

link

ceejayoz 267 days ago

If I type two dashes—like this—my phone changes it into a special character. Same for three dots…

link