Y
Hacker News
new
|
ask
|
show
|
jobs
by
eihli
1728 days ago
Nice. I worked on something similar but far less robust:
https://github.com/eihli/image-table-ocr
. It fails to find the tables on the example images at extract-table.com, but the code is heavily commented at
https://eihli.github.io/image-table-ocr/pdf_table_extraction...
so there's high visibility into what's going on and what needs to change to get it to work with images of different sizes/fonts.