Hacker News new | ask | show | jobs
by UltraSane 66 days ago
Modern OCR is VERY accurate. Heck Adobe Acrobat Pro OCR was essentially perfect 20 years ago.
2 comments

One of my hobbies is typesetting modern editions of a certain type of rare, obscure old books that were poorly typeset to begin with. Modern OCR—and I’ve tried plenty of tools—is still rather error prone in my application.
Can you name a good open source one? I have spent many hours in the current decade correcting OCR errors. Mostly tesseract.