| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by kfajdsl 165 days ago
	All the LLM logprob outputs I've seen aren't very well calibrated, at least for transcription tasks - I'm guessing it's similar for OCR type tasks.

1 comments

energy123 165 days ago

"I already decided in my private reasoning trace to resolve this ambiguity by emitting the string '27' instead of '22' right here, thus '27' has 100% probability"

link