| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by localfirst 697 days ago

> huge slice of human psyche encoded into it than the language itself and the corpus of texts in it

This type of wording is problematic because it conflates what is written as representative of our psyche when it does not.

Psyche implies thinking and thoughts that have occurred when the brain accessed those concepts from outside our 3D world, processed it internally, vocalized it into sounds and then finally letters.

LLMs are just doing pattern text search on top of what is written, it is doing no sort of reasoning or accessing of the hyper dimensional plane like our brain does when it thinks or reasons with concepts.

Our brains are not some exotic token or neurosymbolic search engines!

1 comments

orbital-decay 697 days ago

Differences in particular representations or the generation process are not very interesting, what matters is the stuff encoded in it. And as I said, calling a model a token predictor is right, but kind of misses the forest for the trees, it's an argument on a lower abstraction level that is not very useful.

The biological capabilities of a single human are also not very impressive, by the way. 90% (made up number) of what you consider your intelligence is actually the result of the biological evolution and social processes accumulating and abstracting the knowledge over endless generations. Hypothetical you raised without any contact with other humans, society, culture, education will be substantially different. So the processes are not just in your brain.

Whether you or me are doing "reasoning" is the matter of definition, and it's a really vague term. If you try to define it with more precision, you might come up with an idea that all we do is post-rationalizing the result of our blind prediction.

> This type of wording is problematic because it conflates what is written as representative of our psyche when it does not.

It definitely is representative, in some way. Human civilization did a huge amount of combined computation to encode the human behavior (personal, social, all kinds) into abstractions/semantics hidden in the language and text. Surely it can be recovered with some precision by statistical analysis and some computation. Which is what a large language model does.

Of course this "reverse engineering" approach has limitations. The model might not be able to generalize well enough to pick up higher level semantics. It might be architecture-limited. Some data might just not be in the dataset. The model will never be able to 100% copy humans without having an extremely precise biological reference, as well as you'll never be able to copy a dolphin, alien, or a model. But having an artificial human is not the point of this, and the achievable precision might be just good enough.

link

localfirst 696 days ago

you are fixated on the output and what we can do within that limited set of data while ignoring the thought processes that was involved behind outputting that data as well as interpreting it.

without that "thinking" portion and simply mimicking to the point it resembles it while no such activity is happening (as I defined as accessing the conscious hyper dimensional cloud we humans can do easily).

intelligence in the english vocabulary is limited to retrieval which seems to be why there is so much push towards LLMs but this like trying to dance to a painting, you can interpret it as music and mimic dance moves but its different when a human hears the music and moves naturally.

link

orbital-decay 696 days ago

That sounds... suspiciously like attributing inherent magic to humans. Who cares in practice that mechanisms under the hood are different if the output is the same? Why exactly are you sure humans do all these vaguely defined things inherently, and that it's not an emergent property? What even makes you think you do that? Individual capabilities are vastly overrated.

And the goal is not copying humans to begin with, just interfacing with them and be aligned just enough.

link