Hacker News new | ask | show | jobs
by manmal 497 days ago
Have you considered that the nature of numeric characters is just so predictable that they can be sorted without actually understanding their numerical value?
1 comments

Can you say more precisely what you mean?
I mean that maybe gradient descent is a passable sorting algorithm, once the weights have been learned to properly describe ordering. It may be a speciality of transformers that they can sort things well. Which wouldn’t tell us that much about whether they are mentalists or not.