Hacker News new | ask | show | jobs
by xophishox 3290 days ago
Give me an api end point to send an image to, and a text response. Ill hand you cash.
5 comments

There are a ton of these now. Google provides OCR as part of their machine vision API. AWS has similar with Rekognition. As others have mentioned, there are dozens of others on less well known platforms.
Actually, based on my tests, there are only a few good services:

Abbyy (best recognition rate but by far most expensive), Google Cloud Vision (second best recognition rate), Microsoft OCR and... our OCR.space service with a very generous free tier and a competitive priced PRO tier.

Rekognition from Amazon doesnt have OCR as far as I remember
Under the hood, the extension is using:

https://ocr.space/ocrapi

From the creators of Copyfish: https://ocr.space/

They should have an API to point to. It is fairly accurate. I use them occasionally via ShareX, which uses their API for OCR.

E: https://ocr.space/ocrapi

Like a9t9 said, ABBYY, Microsoft and Google offer this.

If your images however differ from the typical text document, recognition from those services will fail. OCR is highly dependent on the particular application and the kind of images that you're dealing with. Preprocessing and segmentation are very important.

If you need a custom solution, my email is in my profile.

Same