| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by mananaysiempre 81 days ago
	IIRC people actually measured it, and one of the things RLHF does is to turn the fairly well-calibrated probability judgments of the raw predictive model into an essentially binary and much more inaccurate “definitely” / “no idea, coin toss”, the former member of the pair being of course much more frequent. The architecture is perfectly capable of uncertainty, it’s the humans that hate it and sand the capability off until the result fits their preconceptions. (Which is intensely depressing to a human that doesn’t.)