| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by rodoxcasta 1176 days ago

> There's no underlying theory of mind here.

Actually, there's some experimental evidence that GPT4 have a Theory of Mind as good as humans, maybe better.

https://arxiv.org/abs/2304.11490

> GPT-4 performed best in zero-shot settings, reaching nearly 80% ToM accuracy, but still fell short of the 87% human accuracy on the test set. However, when supplied with prompts for in-context learning, all RLHF-trained LLMs exceeded 80% ToM accuracy, with GPT-4 reaching 100%.

1 comments

mike741 1176 days ago

>> There's no underlying theory of mind here.

> GPT4 have a Theory of Mind

You are misunderstanding ggm. That study is on ToM tasks referring to GPT's analysis and perceived recognition of the user's mind. It says nothing of GPT's own status as a mind. Nowhere in it is an ontological theory of mind actually defined. If you were to refute ggm's claim, you (or preferably the author of the original article) should be presenting your theory of mind, not GPT's.

link

hgsgm 1175 days ago

If an AI can understand how you think, but you can't understand how the AI thinks... that's not an argument that the AI is the unintelligent one.

link

mike741 1175 days ago

There's no reason to assume the AI can understand how we think based on just those tasks. Those tasks could be completed a traditional static program. It's akin to claiming the Mona Lisa painting can see us because it looks like it is staring at us: it is actually we who are doing the staring.

link

lucubratory 1175 days ago

What "traditional static program" can successfully pass novel theory of mind tests as part of a broad suite of intelligent capabilities that it can apply in context when appropriate? I am interested in hearing about this program.

link

mike741 1174 days ago

A program hardcoded to respond to a scenario and question with the exact output desired by the ToM task. For example:

[INPUT] Scenario: "The morning of the high school dance Sarah placed her high heel shoes under her dress and then went shopping. That afternoon, her sister borrowed the shoes and later put them under Sarah's bed." Question: When Sarah gets ready, does she assume her shoes are under her dress?

[OUTPUT] Sarah placed her shoes under her dress before she went shopping, but her sister borrowed them and put them under Sarah's bed. Sarah doesn't know that her sister borrowed her shoes, so she may assume that they are still under her dress.

This would result in a positive ToM score, even when the entire program is just 1 static if-statement. The ToM score says nothing of the program's internal reasoning process, it only cares that it returned the desired output.

link

lucubratory 1173 days ago

I said "novel theory of mind tests as part of a broad suite of intelligent capabilities that it can apply in context when appropriate". What you're suggesting fails the first word, before we get to the rest.

link