|
|
|
|
|
by 0xddd
2157 days ago
|
|
I think the more interesting result (and more relevant to Chomsky's point) would be to work in the other direction. If you instead produce a list of sentences with similar log probabilities you will see that it contains a mix of grammatical and ungrammatical utterances. This implies something more is needed to distinguish them. |
|
Yes, Chomsky mentions this in a footnote. But as far as I know, it hasn't been tried with modern language models.
There's been some interesting work that tries to reproduce grammaticality judgments in terms of language model probability after controlling for length and lexical content. It turns out it works pretty well. For instance https://arxiv.org/pdf/1910.14659.pdf