| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by stonecauldron 80 days ago
	Exactly, I think that by their very design, LLMs are very sensitive to how a question is framed. But I wonder how much of that comes from RLHF itself or just from the way token prediction works.

1 comments

It's likely the RLHF process since there are significant differences between models about this.