| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by dotsam 900 days ago
	I have played around with the OCR on my mac, and have been very impressed. It has been consistently better than tesseract for my purposes. However, when creating a PDF from images using Preview and exporting using ‘Embed Text’ option to OCR, I have noticed the text is worse than if you OCR the exact same images using the shortcut above or using a script. Presumably Preview is using the Vision framework’s less accurate fast path when preparing the PDF. https://developer.apple.com/documentation/vision/recognizing...