Hacker News new | ask | show | jobs
by TomaszZielinski 933 days ago
Since this is HN, let me share me an idea I had one day.

Psychology/CBT says that we have so called "core beliefs"--i.e. convictions we hold, that we're not really aware of as they are embedded deep in our minds.

For instance, if one of our core beliefs is "the world is a bad place and people are hostile", we can by default perceive other people as potential threats and feel anxious/afraid/endangered.

Now, ChatGPT has the system prompt "I'm ChatGTP, ...." that's hidden from us.

IMVHO that system prompt is like a core belief. Or vice versa--our core beliefs are like ChatGPT's system prompts.

And CBT can be thought as sandbox escape/prompt injection. If we find a way to override the system prompt, we can effectively change it!

Of course it's just a shallow analogy, but maybe someone will find it useful!