|
|
|
|
|
by jbay808
488 days ago
|
|
I was interested in this question so I trained NanoGPT from scratch to sort lists of random numbers. It didn't take long to succeed with arbitrary reliability, even given only an infinitesimal fraction of the space of random and sorted lists as training data. Since I can evaluate the correctness of a sort arbitrarily, I could be certain that I wasn't projecting my own beliefs onto its response, and reading more into the output than was actually there. That settled this question for me. |
|
Language, as a problem, doesn’t have a discrete solution like the question of whether a list is sorted or not.
Seems weird to compare one to the other, unless I’m misunderstanding something.
What’s more, the entire notion of a sorted list was provided to the LLM by how you organized your training data.
I don’t know the details of your experiment, but did you note whether the lists were sorted ascended or descended?
Did you compare which kind of sorting was most common in the output and in the training set?
Your bias might have snuck in without you knowing.