|
|
|
|
|
by canjobear
2157 days ago
|
|
Chomsky was arguing that probability is useless for defining and studying grammaticality. I'm not so sure. GPT-2 says log P("Colorless green thoughts sleep furiously.") = -53.64797019958496 log P("Furiously sleep thoughts green colorless.") = -65.46656107902527 The ungrammatical one is lower probability. But those are famous sentences, and probably present in the training data, so let's try log P("Colorless blue ideas hibernate angrily.") = -60.12953460030258 log P("Angrily hibernate ideas blue colorless.") = -70.02637100033462 |
|