| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by ben_w 658 days ago

The outputs aren't really the same, they simply seem plausible at first glance.

For example, I recently experimented with using ChatGPT to translate a Wikipedia article, on the grounds that it mighy maintain all the formatting and that Transformer models are also used by Google Translate.

As it was an experiment, I did actually check the results before submitting the translated article.

First roughly 3/4 were fine. Final quarter was completely invented but plausible, including references.

LLMs are very useful tools, I'll gladly use them to help with various tasks and they can (with low reliability but it has happened) even manage a whole project, but right now they should treated with caution and not left unsupervised — Peter principle, being promoted beyond their competence, still applies even though they're not human employees.