| I don’t really understand what you’re testing for? Language, as a problem, doesn’t have a discrete solution like the question of whether a list is sorted or not. Seems weird to compare one to the other, unless I’m misunderstanding something. What’s more, the entire notion of a sorted list was provided to the LLM by how you organized your training data. I don’t know the details of your experiment, but did you note whether the lists were sorted ascended or descended? Did you compare which kind of sorting was most common in the output and in the training set? Your bias might have snuck in without you knowing. |
For this hypothesis: The intelligence illusion is in the mind of the user and not in the LLM itself.
And yes, the notion was provided by the training data. It indeed had to learn that notion from the data, rather than parrot memorized lists or excerpts from the training set, because the problem space is too vast and the training set too small to brute force it.
The output lists were sorted in ascending order, the same way that I generated them for the training data. The sortedness is directly verifiable without me reading between the lines to infer something that isn't really there.