|
|
|
|
|
by brookst
1145 days ago
|
|
It's a fair criticism, and ChatGPT does better, but this isn't a great test of model quality. All LLMS that rely on tokenization struggle with being introspective on language. Try asking chatGPT to count how many e's are in a sentence, or to list all words that start with "to" and end wide "de". I haven't heard anyone describe the phenomenon clearly, but I expect it is a challenge with reasoning over both intent of the prompt and specific token IDs. |
|
Having said that, here are the words ChatGPT gave me for the same prompt:
Magi Nagi Sagi Yagi Adagi Galagi Tegagi Sigikagi Tagi Wagagi
It missed Unagi, surprisingly. But it is still leagues ahead of the response primordialsoup got from Lamini.