| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by arthurdd 1403 days ago

Since experimenting with the GPT models I do notice when posts and articles seem to have used generated content, but I can't identify what it is that gives me this feeling. Maybe it is just a feeling.

When using the models, I find the content generated frequently must be edited to be useful. Presumably if it doesn't wander off or repeat itself too much, there is a human in the loop guiding the generation. Take it as seriously as you would any anonymous post on the Internet.

Fact check important stuff by seeking reputable sources. Treat the rest as entertainment. Even if it is possible to detect, just because text is generated does not mean it's incorrect.

I imagine given the way text is generated using probabilities, it can in theory be detected when stretches of text are using high probability words in sequence. I'm not aware of any tools that do this at the moment though.