Hacker News new | ask | show | jobs
by tadfisher 108 days ago
My advice: write your own English prose, and try not to let "LLM-speak" leak into your documentation when using them to edit. Ironically, LLMs just plain suck at writing English, like they're incredibly overfit on marketing copy and press releases. I hope someone is working on this, or at least cares about the problem, because that would make this brave new world palatable for reading.
1 comments

They really don't, if you actually bother prompting them. Give them a voice sample, and tell them to match the tone, and you already get something 10x better. Have them revise with a list of common writing problems - not just common LLM patterns, but guidelines for writing better - and you get rid of more.

Properly prompted, an LLM writes far better than most people.

Without weighing in on whether this is true, I'll point out that LLMs could both be better writers than most people and also be bad writers.

Writing is a difficult skill that many (most?) educational systems do not effectively teach. Most people are terrible writers.

Yes, but for most uses that is irrelevant. Most of the complaints are not about them not being top-level writers, but that they stand out negatively from human writing by relying on a bunch of bad tropes and stereotypical language use.

Maybe we shouldn't use it to write novels if we can't push it well beyond average, but you don't need to get it to produce anything more than pretty much average or a little bit better for it to be good enough in competition with average humans.

That is precisely the problem. When writing technical documentation, such as the landing page for an FPGA inference engine, a model should not need to be prompted to use proper voice and to avoid marketing language. There should be enough context in the text of the prompt itself.
I don't think any of this indicates a fundamental property of the tech itself. AI companies post-train their models to sound like what people like to read better. There's a reason that engagement farmers have converged on the tone that these LLMs imitate, namely its something that people prefer. Maybe not you, but it's the same thing that gives us YouTube face on thumbnails etc.

It takes some prompting to nudge the model out of that default voice because post training reinforced it. They will likely shift it once these AI-isms are known and recognized widely. I'd assume the nextgem models under training now will get negative feedback from the human evaluators for talking too AI-like and then there will be new AI smells to calibrate to.

I'm not sure this invalidates anything I'm saying. The tools currently produce terrible-quality output unless actively prompted to stop producing terrible-quality output. To me, that's a bug, and I don't think post-training and popular preference excuses the tool's behavior. There's no value in normalizing slop if it's so easy to fix.
Should Youtube "fix" the proliferation of exaggerated faces in thumbnails?

People prefer the slop, at least until they collectively notice the AI smell, at which point the post training will likely train it out of models and slop will have new characteristics that take a while for the mainstream to detect.

This is like saying people shouldn't need to be trained for a job.

There's no reason to expect a general purpose model to know what you want when you've not given it any training in what to do for your specific case.

And in this case, the models do far better than humans: Most humans can't just switch to copy arbitrary tone, just by giving them a page worth of text. We don't even need to actually train/fine-tune these models further - we just need to actually fully specify the task we give them to get them to write well.