Hacker News new | ask | show | jobs
by dougb5 826 days ago
My guess is that ChatGPT does it because RLHF rewards strongly stated opinions, because that's what humans prefer. It's a kind of "sycophantic behavior" that researchers have observed in these models (https://arxiv.org/abs/2310.13548)