Hacker News new | ask | show | jobs
by 8note 176 days ago
run some ocr on them after to recreate the text layer?
1 comments

With the aggressive push of LLMs and Generative AI ..i am expecting a lot of OCR features to become "smarter" by default, namely go beyond mechanical OCR and start inserting hallucinations and sematically/contextually "more correct" information in OCR output

It's not hard to imagine some powerful LLMs being able to undo some light redactions that are deducible based on context

Or worse, making up names or information instead of writing the reaction.