|
Differences in particular representations or the generation process are not very interesting, what matters is the stuff encoded in it. And as I said, calling a model a token predictor is right, but kind of misses the forest for the trees, it's an argument on a lower abstraction level that is not very useful. The biological capabilities of a single human are also not very impressive, by the way. 90% (made up number) of what you consider your intelligence is actually the result of the biological evolution and social processes accumulating and abstracting the knowledge over endless generations. Hypothetical you raised without any contact with other humans, society, culture, education will be substantially different. So the processes are not just in your brain. Whether you or me are doing "reasoning" is the matter of definition, and it's a really vague term. If you try to define it with more precision, you might come up with an idea that all we do is post-rationalizing the result of our blind prediction. > This type of wording is problematic because it conflates what is written as representative of our psyche when it does not. It definitely is representative, in some way. Human civilization did a huge amount of combined computation to encode the human behavior (personal, social, all kinds) into abstractions/semantics hidden in the language and text. Surely it can be recovered with some precision by statistical analysis and some computation. Which is what a large language model does. Of course this "reverse engineering" approach has limitations. The model might not be able to generalize well enough to pick up higher level semantics. It might be architecture-limited. Some data might just not be in the dataset. The model will never be able to 100% copy humans without having an extremely precise biological reference, as well as you'll never be able to copy a dolphin, alien, or a model. But having an artificial human is not the point of this, and the achievable precision might be just good enough. |
without that "thinking" portion and simply mimicking to the point it resembles it while no such activity is happening (as I defined as accessing the conscious hyper dimensional cloud we humans can do easily).
intelligence in the english vocabulary is limited to retrieval which seems to be why there is so much push towards LLMs but this like trying to dance to a painting, you can interpret it as music and mimic dance moves but its different when a human hears the music and moves naturally.