Hacker News new | ask | show | jobs
by budududuroiu 712 days ago
Business card scanners have been around since the earliest versions of the iPhone, but I guess thank you ChatGPT for discovering OCR
2 comments

Are there any good OCR packages that are state of the art for general-purpose transcription? (i.e. give it a business card and get it to format it for you, give it a comic and have it transcribe it, give it nutritional info and have it table it)? When I looked recently I pretty much just got GPT-4o as the best API.
Do you have a link to the one that I can put 50 of them on the floor and it will send me back an excel file? I'd like to test it out compared to ChatGPT as I'm going to be implementing "AI" across the whole 700+ person business.
lol good luck doing that with GPT. Right now I can tell you you’ll have missing or malformed or incorrect data, and it will be faster to just pass each one individually through a rudimentary scanner than to sit and figure out which one is correct and which is wrong from the 50 card picture
You're right, I tested it extensively before we designed a training program, it works very well up to about 10/11 but past that it gets lost in the sauce. So pardon me: It only works out to a 6.45 hour savings not a 6 hour savings.