|
|
|
|
|
by 0xddd
2157 days ago
|
|
It's funny the author wasted energy composing this after admitting he barely knows the origin of the sentence. Chomsky invokes it in "Syntactic Structures" to illustrate that the grammaticality of a given sentence doesn't fully explain the odds of it appearing in a large corpus. "Furiously sleep ideas green colorless" is another low probability sentence, yet a native speaker couldn't perform these sorts of mental gymnastics to twist some meaning out of it. |
|
I'm not so sure. GPT-2 says
log P("Colorless green thoughts sleep furiously.") = -53.64797019958496
log P("Furiously sleep thoughts green colorless.") = -65.46656107902527
The ungrammatical one is lower probability. But those are famous sentences, and probably present in the training data, so let's try
log P("Colorless blue ideas hibernate angrily.") = -60.12953460030258
log P("Angrily hibernate ideas blue colorless.") = -70.02637100033462