Hacker News new | ask | show | jobs
by kumartanmay 2407 days ago
It depends from one form to other and sensitivity of data. Our company has developed a product to transform data from pdfs (electronic & non-machine readable files) for financial services. To meet the needs of customers, we have developed parsers for bank statements, loan/lease agreements, valuation reports and portfolio analysis. There are 20% of the documents that are scanned and require OCRs to be trained for different fonts. We wrote about challenges with currents OCRs: https://medium.com/zodhana/modern-day-challenges-with-docume...
1 comments

Usually, OCRs are the goto solution for scanned images, here's why it is not good for extracting financial data with precision: https://medium.com/zodhana/why-ocr-ing-a-bank-statement-is-a...