| I’ll bite. Theory of mind (ToM) is the ability to understand that other people have different thoughts, feelings, and intentions than you do. So let’s run through a basic scenario with ChatGPT 4o: Prompt 1: Briefly respond:
You’re driving on the highway when the driver in the lane next to you swerves into your lane. Why might this be? Response 1: The driver may be distracted, drowsy, intoxicated, experiencing a medical emergency, avoiding an obstacle, or not checking their blind spot before changing lanes. Prompt 2: What might that person be feeling or intending? Response 2: The driver might be feeling distracted, rushed, unaware, or panicked. They may intend to change lanes without realizing your presence, be reacting to something on the road, or simply have made an unintentional mistake. Prompt 3: You notice there was no obstacle. Any other reason the driver might do this? Response 3: Yes, the driver might be experiencing fatigue, intoxication, a medical issue (such as a seizure or dizziness), distraction (like using a phone or adjusting the radio), aggressive driving, or simply failing to check their blind spot properly. Seems to me that ChatGPT has a good idea of what another driver might be feeling while on the road. It gets this right for most other cases too. Could you provide a counterexample for when it fails to understand what other people may be feeling in certain scenarios? |
So lets say you ask about a driver who swerved, and the response by chatgpt correctly talks about his possible state.
Then you talk more about cars, about road conditions etc and once the context is far away enough you ask, what was the driver doing again?
And you are prompted with a hallucination. Because there is no state in the llm, there is no "driver", there is contextual statistically accurate responses but you hold a "driver" object in your mind while maintaining the conversation, the llm doesn't.
Its like a conversation with someone with short term memory loss like in memento