Hacker News new | ask | show | jobs
by ikidd 78 days ago
Funny enough I was processing some handwritten tables into excel with Sonnet. It did way better than I thought it would, I'd say like 95%.

I did have it put confidence indexes next to the output per line, and that was pretty useless, they were either really high or really low, and the confidence didn't match the mistakes at all.

1 comments

IMHO LLMs cannot provide statistically confident measures, and they are terrible at pretending to be capable of doing so.

What worked: You use an OCR that provides character/word-level bounding boxes and let the LLM extract from data. Then the LLM is capable of "calculating" a confidence of extracted data.