Hacker News new | ask | show | jobs
by malfist 3 hours ago
Pretty decent might be quiet the stretch. I'd term it almost acceptable, but only if you're using commercial solutions like amazon's textract, doing it with open source tools is at best, extremely painful and vaguely accurate.
1 comments

PaddleOCR (also from Baidu) is pretty damn good actually.
I have shipped with PaddleOCR to prod. Works pretty well. (Usage limited to printed documents in Anglosphere). Runs fully offline, in CPU.