| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by minkzilla 1865 days ago
	Or you just train a machine to do it and then generate a bunch and have this second machine sort out any it thinks are machine generated.

2 comments

truth_ 1865 days ago

The best fake-detecting model detecting fakes generated by the best generator model will always lag behind the latter model.

link

minkzilla 1864 days ago

I think I see what you’re saying, but why is this so?

link

PeterisP 1864 days ago

In essence detecting which one is fake is a common way how you train the generator, tweaking the generating process to "fix" any detectable flaw; and you train it until (as far as your system is concerned) the generated texts are indistinguishable from the real ones. A better system might distinguish them, but that better system can be relatively trivially adapted to generate better texts which it won't be able to distinguish from real ones.

link

toxik 1865 days ago

You basically just described a GAN. Neat!

link

minimaxir 1865 days ago

GANs work by feeding back the mistakes and forcing the generator model to improve its cheating. In this case, filtering out titles that are ambiguous would act as an independent filter.

link

toxik 1864 days ago

It’s not an exact description of a GAN, but then I never said it was either.

link