Hacker News new | ask | show | jobs
by jumpskiphop 2400 days ago
the problem with extracting information is not just limited to getting OCR results. the bigger problem while building something like this is extracting the fields and understanding the structure of the document automatically. using some python OCR libraries, you'd probably get text results for a drivers license or a passport separately and process these results on separate rules written for each. with deep learning a non-template solution seems possible which will figure out which ID it is, where the name, address, relevant numbers are and put them in a structure.
1 comments

In the case of U.S. drivers licenses, there are standards for the 2D barcode that would make it very straightforward to parse: https://www.aamva.org/uploadedFiles/MainSite/Content/Solutio...