Hacker News new | ask | show | jobs
by dotsam 900 days ago
I have played around with the OCR on my mac, and have been very impressed. It has been consistently better than tesseract for my purposes.

However, when creating a PDF from images using Preview and exporting using ‘Embed Text’ option to OCR, I have noticed the text is worse than if you OCR the exact same images using the shortcut above or using a script. Presumably Preview is using the Vision framework’s less accurate fast path when preparing the PDF. https://developer.apple.com/documentation/vision/recognizing...