| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by sega_sai 451 days ago
	Can we have models also return a probability, reflecting how accurate the statements it made is ?

2 comments

cyanydeez 451 days ago

Sure, but then you need probability stats on the probability stats.

link

sega_sai 451 days ago

I am not sure what you mean. The idea is that the network should return the text, and a confidence expressed as probability. When trained, the log-score should be optimized. (i'm not sure it would actually work given how the training is structured, but something like this would be useful)

link

redman25 451 days ago

It's not that simple how would the model know when it knows? Removing hallucination has to be a post-training thing because you need to test the model against what it actually knows first in order to provide training examples of what it knows and doesn't know and how to respond in those circumstances.

link

jsnider3 451 days ago

You can ask a model to give you probability estimates of its confidence, but none of the frontier models were trained to be good at giving probability estimates to my knowledge.

link