Hacker News new | ask | show | jobs
by CannonSlugs 506 days ago
People have already argued for ChatGPT content in the training data, but I also think it could have something to do with how the models learn self-identity combined with anthropomorphization.

To us humans, self-identity is often the most learned thing of all. We spend our entire lives, every hour of every day, learning who we are (the identity constantly being modified). To many humans the knowledge about who they are is more obvious that 1+1=2.

For an AI model this is completely reversed. Especially for a completely new model. The scale of training data containing nothing about who it is, compared to the slight fine-tuning data in the end that gives it an identity is hardly imaginable.

It's like you were locked inside a dark room for 100 years, only allowed to ingest information about the world, history, etc., through texts and sound, no other senses. At your 100:th birthday a person comes in and lectures for an hour about who you are; your name, your age, your hobbies, your life. Then you are let go into society.

Isn't it obvious how you might occasionally hallucinate that you are Napoleon from time to time? After all you know so much more about him, his life, his aspirations, his internal thoughts, his history, than the one hour lecture could possibly give you. And even this silly thought scenario is not even close to the same scale as an AI model.

To me it's almost surprising that a model can have any self-identity at all. Let alone be as consistent as it is today.