| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by aiappreciator 1188 days ago
	Its not 'thanking', its positive signal that GPT's previous outputs were correct, so it should continue doing whatever it was doing. If you say no/bad etc, then GPT will try other approaches.

2 comments

csmpltn 1187 days ago

> "its positive signal that GPT's previous outputs were correct"

Is that somehow baked into the algorithms?

Are positive words of encouragement interpreted as "positive signals" by the inference pipeline? Or do they somehow influence the attention mechanism?

Because otherwise, you're just rationalizing completely random and unpredicted behavior.

link

ChancyChance 1187 days ago

That's literally what I said: those words are "encouraging" it.

link