Hacker News new | ask | show | jobs
by creesch 382 days ago
There are some things so ubiquitous in the training data that it is really difficult to tell models to not so them. Simply because it is so ingrained in their core training. Em dashes are apparently one of those things.

It's something I read a lottle while ago in a larger article but can't remember which article it was.