| It's easy to focus in on particular linguistic tics, which will probably get smoothed away in future training. The underlying issue is that the LLM is trying to ape meaningful writing - which takes the reader from A to a surprising Z - without generally basing it on a meaningful insight. Most of the common tells stem from that desire to signal the gap between what it's writing about now and how you previously thought things worked. And they really fall flat when it's writing about some milquetoast truism with all the edges sanded off. Rather than have "Z" be self-evidently interesting, the LLM need to tell us that it's not "A". Except no one thought anything was "A" in the first place, and the "Z" is barely a "B" let alone a "Z". Or things are "quietly X," implying that there is some secret knowledge that other people have. Barring that, the LLM will signpost arguments, telling you how interesting things are. "This is the critical part..." before launching into another banal non-observation. Why have only "Section Header" when you can have "Section Header (another idea I had) - omg I just have so much to say about this". Maybe a list of weak ideas, presented in multiple ways. Bulleted with emojis. As a rule of three. As choppy sentences in a paragraph. All of these follow the same pattern - trying to follow the FORM of having something interesting to say that differs from consensus understanding, without having the intelligence or boldness to actually midwife a new concept into the world. |
I think another tell with a long lifespan is that it will try to write about a thing it experienced, but having no real knowledge of the qualia, it'll end up only referring to it (not uncommonly using "that") and leaving all the substance as an exercise for the reader