| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by codeflo 1864 days ago
	5/5 on hard node, but it’s tough sometimes, I don’t actually know much about biology. But if you’ve played around with GPT before, you get better at spotting the subtle logical errors it tends to make. I wonder whether the ability to identify machine generated texts will become a useful skill at some point.

1 comments

minkzilla 1864 days ago

Or you just train a machine to do it and then generate a bunch and have this second machine sort out any it thinks are machine generated.

link

truth_ 1864 days ago

The best fake-detecting model detecting fakes generated by the best generator model will always lag behind the latter model.

link

minkzilla 1864 days ago

I think I see what you’re saying, but why is this so?

link

PeterisP 1864 days ago

In essence detecting which one is fake is a common way how you train the generator, tweaking the generating process to "fix" any detectable flaw; and you train it until (as far as your system is concerned) the generated texts are indistinguishable from the real ones. A better system might distinguish them, but that better system can be relatively trivially adapted to generate better texts which it won't be able to distinguish from real ones.

link

toxik 1864 days ago

You basically just described a GAN. Neat!

link

minimaxir 1864 days ago

GANs work by feeding back the mistakes and forcing the generator model to improve its cheating. In this case, filtering out titles that are ambiguous would act as an independent filter.

link

toxik 1863 days ago

It’s not an exact description of a GAN, but then I never said it was either.

link