Hacker News new | ask | show | jobs
by thelittleone 479 days ago
How about building a tool which indexes ocr chunks / tokens and a confidence grading. Setting a tolerance level and defining actions where the token or chunk (s) fall below that level. Actions could include could include automated verification using another model or last resort human.
1 comments

How would you calculate the confidence? LLMs are notoriously bad at grading their own output.