| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by dougb5 826 days ago
	My guess is that ChatGPT does it because RLHF rewards strongly stated opinions, because that's what humans prefer. It's a kind of "sycophantic behavior" that researchers have observed in these models (https://arxiv.org/abs/2310.13548)