Hacker News new | ask | show | jobs
by p1esk 2154 days ago
It's a lot easier to notice logical mistakes in already written text, than it is to avoid making them in the first place

For a human who does logical thinking, yes. But for a language model? I'm actually not sure, because it's possible that a sufficiently complex language model like GPT-3 does form some kind of general logical rules encoded in its weights somehow. This would be interesting to explore.

I actually have implemented it and it works quite reasonably.

Oh, so you are trying to design GPT-2 like a GAN, or at least move into that direction. Interesting. Yes, I don't see why not. What do you think about taking a step further, and actually making it a GAN, i.e propagating the error from discriminator into the encoder? I'm sure you're aware of multiple attempts to do this with smaller models, with mediocre results, but maybe GPT-3 scale is what needed to make it work?