| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by mmoskal 799 days ago
	You can ask the model sth like: is xyz correct, answer with one word, either Yes or No. The log probs of the two tokens should represent how certain it is. However, apparently RLHF tuned models are worse at this than base models.

1 comments

nurple 799 days ago

Seems like functions could work well to give it an active and distinct choice, but I'm still unsure if the function/parameters are going to be the logical, correct answer...

link