Hacker News new | ask | show | jobs
by soVeryTired 248 days ago
Have you used PaddleOCR? I'm surprised they're claiming SOTA without comparing against Amazon Textract or Azure doc intelligence (LayoutLM v3 under the hood, as far as I know).

I've played around with doc recognition quite a bit, and as far as I can tell those two are best-in-class.

1 comments

Amazon textract is not great at multi colum layouts in my experience. Docupanda or some azure models beat it. Just my 2 cents.