Hacker News new | ask | show | jobs
by smusamashah 308 days ago
Any tool that takes a scanned PDF, then overlay's OCRed text over scan so that text becomes searchable?
1 comments

https://github.com/ocrmypdf/OCRmyPDF

>OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

I ... I nailed it.

Just a note that OCRmyPDF currently uses Tesseract