Hacker News new | ask | show | jobs
by kkfx 790 days ago
If you didn't now it, give ocrmypdf (python/tesseract wrapper) a try, all you need is `ocrmypdf in.pdf out.pdf`. It's not super-perfect but works well enough in 99% of common cases.