| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by badsandwitch 603 days ago

Has anyone tried to see what the output looks like if the model is never allowed to be uncertain?

For example, whenever certainty drops below a threshold the sampler backtracks and chooses different tokens. Such that at the end every single token had an above threshold certainty.

I doubt it would entirely eliminate undesirable outputs, but it would be interesting.

2 comments

eddd-ddde 603 days ago

Couldn't that just, never get an answer?

Or maybe just says "i don't know" with full certainty.

link

zbentley 603 days ago

That would be extremely useful in some domains.

link

mumblemumble 603 days ago

Perhaps only if you can also be very certain that the output is correct whenever the logprobs don't trigger the filter.

If that's not the case then it might just trigger bad risk compensation behavior in the model's human operators.

link

Jerrrrrrry 603 days ago

You used to get purely determinant near-quotes, but still affected by floating point inaccuracies.

link