| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ocbyc 821 days ago
	Transformers are just pattern matching. So if you write "give me a list of dog names" it knows that "Spot" should be in that result set. Even though it doesn't really know what a dog is, a list is, or what a spot is.

5 comments

rafaelero 821 days ago

> Transformers are just pattern matching.

That's trivially true. The question is: are we any different?

link

richardw 821 days ago

I think so. You ask that question because you’re interrogating the position, not because 1000 humans have asked that question in similar situations.

You and I know there’s a truth and we’d like to find it. The GPT is just happy (I.e. rewarded) to produce frequently used tokens.

link

mewpmewp2 821 days ago

And I'm just happy to perform actions that will make me survive and reproduce?

link

richardw 821 days ago

Most likely, unless you meditate a lot. Sometimes you'll take a bullet to save other people. Sometimes you'll drink yourself into a state that doesn't help you survive or reproduce. Or you'll write on a forum anonymously that doesn't help with survival or reproduction because it's enjoyable, makes you think, or you're addicted. Who knows :)

link

mewpmewp2 821 days ago

You are even better at analyzing me than GPT-4.

link

parpfish 821 days ago

but maybe that feeling of 'looking for truth' is just what happens when you're doing pattern matching on the text embeddings?

link

richardw 821 days ago

I feel it’s a bit more, given we think about it after the sentence is complete. But it raises some interesting questions about what an agent would do if it had an instruction to keep trying until it got a reliable answer. Maybe an argument generating agent and a critic agent.

Worth a shot :)

link

parpfish 821 days ago

I approach LLMs with the perspective that “maybe this demonstrates that we humans are all just stochastic parrots?”and we should have the null hypothesis that humans are just pattern matchers.

link

mewpmewp2 821 days ago

This is the way I perceive my thoughts. I don't know what I'm going to think of beforehand or in advance, these could all be stochastic "tokens" based on what I've observed in my life.

So of course I feel a bit offended when people claim LLMs are just stochastic parrots, because it doesn't feel to me, that I'm specifically any better?

My thoughts - they just happen, and sometimes not in my favor - I have had times of depression, I didn't have control over my thoughts. Neither do I have now, but at least I am in a better place. Because the "happiness" chemicals are regulated to be in a more favorable state to me for various different factors.

I didn't know what I was going to comment in response to your comment, I was just streaming my conscious.

link

bongodongobob 821 days ago

I don't think that's true. They clearly group related things together and seem to be able to create concepts that aren't specifically in the training data. For example, it will figure out the different features of a face, eyes, nose, mouth even if you don't explicitly tell it what those are. Which is why they are so cool.

link

zeusk 821 days ago

Most of that magic comes from embedding no? which is clustering things by their relation in some N-dimensional space

link

bongodongobob 821 days ago

Exactly. It figures that out on its own. That's what "understanding" looks like in this context, imo.

link

mewpmewp2 821 days ago

They are cool, but then you are also cool.

link

ALittleLight 821 days ago

Can you describe a test that would separate trivial pattern matching from true understanding?

link

lottin 821 days ago

A simple conversation would do.

link

mewpmewp2 821 days ago

Could you share a conversation link with GPT-4 with either about a "list" or a "dog", to determine whether it truly understands one of those things compared to a human?

link

lottin 820 days ago

I don't have a GPT account. I would start with: "Do you like dogs?" Next question: "Why?"

link

mewpmewp2 820 days ago

It kind of answered "why" for me

"""I think dogs are wonderful! They're known for their loyalty, playfulness, and their ability to bring joy to people's lives. What about you? Do you have a favorite breed or dog story?"""

What do I ask next?

link

lottin 819 days ago

This reply sounds so fake that in my opinion should be enough to rule out any hint of intelligence. However if you insist I'd continue with this:

"I'm not a fan of dogs. I do know a few dogs though. Sometimes I invite my neighbour's dog for dinner. He's got good taste, for a dog. The last time he came around we talked about the situation in the Middle East. Do you know a good book about this topic that I could recommend to him?"

link

bongodongobob 821 days ago

Just did that. It seems to understand. Checkmate /fingerguns

link

mewpmewp2 821 days ago

How would I test whether I "know" or "understand" what a dog is?

link

notahacker 821 days ago

Oh, that's easy, we just give the dog a keyboard and see if you accurately identify it's a dog from your text based interactions ;-)

link

mewpmewp2 821 days ago

Are you calling me a dog?

link

inopinatus 821 days ago

Even this seems too grand a claim. I’d water it down thus: the LLM encodes that the token(s) for “Spot” are probabilistically plausible in the ensuing output.

link

bongodongobob 821 days ago

...because it understands what a dog name is. Why wouldn't you see Gary or Florence in that list? How does it know those aren't dog names?

You can't be suggesting it has memorized relationships between all concepts, the model would be enormous.

So clearly, there is something else going on. It's able to encode concepts/ideas.

link

inopinatus 821 days ago

The model is enormous, and N-dimensional for very high N. But the model remains insufficiently enormous for understanding, and moreover, the model cannot observe itself and adjust.

Ask an LLM to extrapolate, see any semblance of reason collapse.

link

mewpmewp2 821 days ago

Extrapolate what?

link