They're also trivially foolable by using sampling techniques or settings which encourage the model to generate rare words a lot.
Also foolable with filter-assisted decoding: https://paperswithcode.com/paper/most-language-models-can-be...
They're also trivially foolable by using sampling techniques or settings which encourage the model to generate rare words a lot.
Also foolable with filter-assisted decoding: https://paperswithcode.com/paper/most-language-models-can-be...