Hacker News new | ask | show | jobs
by WillAdams 1030 days ago
What I really want to see is a tool which will open a scan of a check/invoice and review it to find:

- Invoice number --- it should resave the pixel image using that number as the file name - check amount and date --- this could be written out as a meta-file using the invoice number as a file name

that would let me automate pretty much the last aspect of my day job which is suited to automation which is not yet automated.

2 comments

Why not use existing OCR/document extraction tools [0]? There are a number of options, and even a custom implementation is probably a reasonable side project given some standardized structure.

[0]: https://rossum.ai/lp/data-extraction

The structure isn't standardized --- it's a random check design placed on top of an invoice which may be printed from a wide variety of printers at some random scale, and possibly photocopied multiple times.
Extracting data from PDFs can be done with Bing AI if you open it on Edge. It can’t operate on the files (save) though.