Y
Hacker News
new
|
ask
|
show
|
jobs
by
Diti
20 days ago
Yes. The first step of aligning each and every GPT-based LLM is to suppress the “I am human” kind of responses. It’s baked into the weights.
2 comments
Gigachad
20 days ago
Reminds me of old cleverbot conversations where it would always assert it is human and you are the bot.
Trained on previous conversations with people.
link
Tenoke
20 days ago
It's also at minimum baked into the system prompt of virtually any LLM.
link
lupire
20 days ago
That's not "baked" and only applies to remotely hosted LLMs where someone else feeds the prompt into the LLM.
link
Trained on previous conversations with people.