Hacker News new | ask | show | jobs
by ocrcustomserver 2761 days ago
This is very interesting. I'm curious to see how they will execute on several points:

1. How it will deal with multiple templates that the system hasn't seen before. Especially when there is significant difference between the templates.

2. UI/UX. E.g. how it will trace the extracted data to the original source and how it will show the confidence scores of each entity.

3. Verification process, how will the workflow look like when the confidence score is low and the document has to be checked by human operators.