Hacker News new | ask | show | jobs
by alexcnwy 2403 days ago
This is a decent OCR / structured data extraction literature review but is absolutely not "building an ID card reader from scratch with deep learning".

It's also very hand-wavey on the details of how to actually use graph convolutional networks to extract structured ID card data. For example what "bounding box information" is used in your node representations? What is the architecture of your biLSTM?

This seems very much more like a promotion for your API than useful information on how to build a system that extracts data from ID cards.

1 comments

Sure, the blog might have missed out on the finer details of the different architectures. We intended to give an overview of some of the techniques used to build such information extraction models, we will definitely dive deeper into one of the architectures/model as a second part to this blog.